Dataset info
| Number of variables | 181 |
|---|---|
| Number of observations | 20000 |
| Missing cells | 2408219 (66.5%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 27.2 MiB |
| Average record size in memory | 1.4 KiB |
Variables types
| Numeric | 54 |
|---|---|
| Categorical | 23 |
| Boolean | 19 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 84 |
| Unsupported | 0 |
Warnings
coligada_mais_antiga_ativa has 17302 (86.5%) missing values | Missing |
coligada_mais_antiga_baixada has 19990 (> 99.9%) missing values | Missing |
coligada_mais_nova_ativa has 17302 (86.5%) missing values | Missing |
coligada_mais_nova_baixada is highly correlated with coligada_mais_antiga_ativa (ρ = 0.94271) | Rejected |
de_faixa_faturamento_estimado has 1154 (5.8%) missing values | Missing |
de_faixa_faturamento_estimado_grupo has 1154 (5.8%) missing values | Missing |
de_indicador_telefone has constant value "BOA" | Rejected |
de_nivel_atividade has 448 (2.2%) missing values | Missing |
de_saude_rescencia has 608 (3.0%) missing values | Missing |
de_saude_tributaria has 608 (3.0%) missing values | Missing |
dt_situacao only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
dt_situacao has a high cardinality: 4215 distinct values | Warning |
empsetorcensitariofaixarendapopulacao has 6268 (31.3%) missing values | Missing |
faturamento_est_coligados is highly correlated with coligada_mais_nova_baixada (ρ = 0.91204) | Rejected |
faturamento_est_coligados_gp is highly correlated with faturamento_est_coligados (ρ = 0.99793) | Rejected |
fl_epp has constant value "False" | Rejected |
fl_optante_simei has 3538 (17.7%) missing values | Missing |
fl_optante_simples has 3538 (17.7%) missing values | Missing |
grau_instrucao_macro_analfabeto has 19930 (99.7%) missing values | Missing |
grau_instrucao_macro_desconhecido has constant value "nan" | Rejected |
grau_instrucao_macro_escolaridade_fundamental has 18909 (94.5%) missing values | Missing |
grau_instrucao_macro_escolaridade_media is highly skewed (γ1 = 27.975) | Skewed |
grau_instrucao_macro_escolaridade_media has 17060 (85.3%) missing values | Missing |
grau_instrucao_macro_escolaridade_superior has 19058 (95.3%) missing values | Missing |
idade_acima_de_58 has 19524 (97.6%) missing values | Missing |
idade_ate_18 has 19888 (99.4%) missing values | Missing |
idade_de_19_a_23 has 18870 (94.3%) missing values | Missing |
idade_de_24_a_28 has 18368 (91.8%) missing values | Missing |
idade_de_29_a_33 has 18292 (91.5%) missing values | Missing |
idade_de_34_a_38 is highly correlated with idade_de_29_a_33 (ρ = 0.96376) | Rejected |
idade_de_39_a_43 is highly correlated with idade_de_34_a_38 (ρ = 0.95304) | Rejected |
idade_de_44_a_48 is highly correlated with idade_de_39_a_43 (ρ = 0.92955) | Rejected |
idade_de_49_a_53 is highly correlated with idade_de_44_a_48 (ρ = 0.9679) | Rejected |
idade_de_54_a_58 is highly correlated with idade_de_49_a_53 (ρ = 0.94691) | Rejected |
idade_maxima_coligadas is highly correlated with coligada_mais_nova_baixada (ρ = 0.94271) | Rejected |
idade_maxima_socios has 6586 (32.9%) missing values | Missing |
idade_media_coligadas is highly correlated with coligada_mais_nova_baixada (ρ = 0.97597) | Rejected |
idade_media_coligadas_ativas is highly correlated with idade_media_coligadas (ρ = 0.99868) | Rejected |
idade_media_coligadas_baixadas is highly correlated with idade_media_coligadas_ativas (ρ = 0.92832) | Rejected |
idade_media_socios is highly correlated with idade_maxima_socios (ρ = 0.95975) | Rejected |
idade_minima_coligadas is highly correlated with coligada_mais_nova_ativa (ρ = 0.99882) | Rejected |
idade_minima_socios is highly correlated with idade_media_socios (ρ = 0.95683) | Rejected |
max_faturamento_est_coligados is highly correlated with faturamento_est_coligados_gp (ρ = 0.94776) | Rejected |
max_faturamento_est_coligados_gp is highly correlated with max_faturamento_est_coligados (ρ = 0.99411) | Rejected |
max_filiais_coligados is highly correlated with coligada_mais_nova_baixada (ρ = 0.97946) | Rejected |
max_funcionarios_coligados_gp is highly correlated with coligada_mais_nova_baixada (ρ = 0.9016) | Rejected |
max_meses_servicos is highly correlated with idade_media_coligadas_baixadas (ρ = 0.97658) | Rejected |
max_meses_servicos_all has 15515 (77.6%) missing values | Missing |
max_vl_folha_coligados is highly correlated with max_filiais_coligados (ρ = 0.91799) | Rejected |
max_vl_folha_coligados_gp is highly correlated with max_vl_folha_coligados (ρ = 0.9144) | Rejected |
media_faturamento_est_coligados has 17332 (86.7%) missing values | Missing |
media_faturamento_est_coligados_gp is highly correlated with media_faturamento_est_coligados (ρ = 0.98196) | Rejected |
media_filiais_coligados is highly correlated with coligada_mais_nova_baixada (ρ = 0.98273) | Rejected |
media_funcionarios_coligados_gp is highly correlated with coligada_mais_nova_baixada (ρ = 0.93916) | Rejected |
media_meses_servicos is highly correlated with idade_media_coligadas_baixadas (ρ = 0.94844) | Rejected |
media_meses_servicos_all is highly skewed (γ1 = 23.838) | Skewed |
media_meses_servicos_all has 15515 (77.6%) missing values | Missing |
media_vl_folha_coligados has 18485 (92.4%) missing values | Missing |
media_vl_folha_coligados_gp has 18477 (92.4%) missing values | Missing |
meses_ultima_contratacaco is highly skewed (γ1 = 34.023) | Skewed |
meses_ultima_contratacaco has 15515 (77.6%) missing values | Missing |
min_faturamento_est_coligados is highly skewed (γ1 = 29.558) | Skewed |
min_faturamento_est_coligados has 17332 (86.7%) missing values | Missing |
min_faturamento_est_coligados_gp is highly skewed (γ1 = 40.941) | Skewed |
min_faturamento_est_coligados_gp has 17332 (86.7%) missing values | Missing |
min_filiais_coligados has 19201 (96.0%) missing values | Missing |
min_funcionarios_coligados_gp is highly skewed (γ1 = 28.164) | Skewed |
min_funcionarios_coligados_gp has 577 (2.9%) zeros | Zeros |
min_funcionarios_coligados_gp has 18262 (91.3%) missing values | Missing |
min_meses_servicos is highly skewed (γ1 = 35.627) | Skewed |
min_meses_servicos has 16703 (83.5%) missing values | Missing |
min_meses_servicos_all is highly correlated with meses_ultima_contratacaco (ρ = 0.9754) | Rejected |
min_vl_folha_coligados has 18485 (92.4%) missing values | Missing |
min_vl_folha_coligados_gp is highly correlated with min_funcionarios_coligados_gp (ρ = 0.93663) | Rejected |
nm_divisao has a high cardinality: 86 distinct values | Warning |
nm_meso_regiao has 2505 (12.5%) missing values | Missing |
nm_micro_regiao has a high cardinality: 74 distinct values | Warning |
nm_micro_regiao has 2505 (12.5%) missing values | Missing |
nu_meses_rescencia has 1905 (9.5%) missing values | Missing |
percent_func_genero_fem has 961 (4.8%) zeros | Zeros |
percent_func_genero_fem has 16721 (83.6%) missing values | Missing |
percent_func_genero_masc has 755 (3.8%) zeros | Zeros |
percent_func_genero_masc has 16721 (83.6%) missing values | Missing |
qt_admitidos is highly skewed (γ1 = 25.07) | Skewed |
qt_admitidos has 15515 (77.6%) missing values | Missing |
qt_admitidos_12meses is highly skewed (γ1 = 27.343) | Skewed |
qt_admitidos_12meses has 3209 (16.0%) zeros | Zeros |
qt_admitidos_12meses has 15515 (77.6%) missing values | Missing |
qt_alteracao_socio_180d has constant value "nan" | Rejected |
qt_alteracao_socio_365d has constant value "nan" | Rejected |
qt_alteracao_socio_90d has constant value "nan" | Rejected |
qt_alteracao_socio_total has constant value "nan" | Rejected |
qt_art is highly correlated with idade_media_coligadas_baixadas (ρ = 1) | Rejected |
qt_coligadas is highly skewed (γ1 = 20.236) | Skewed |
qt_coligadas has 17889 (89.4%) missing values | Missing |
qt_coligados is highly correlated with qt_coligadas (ρ = 0.99975) | Rejected |
qt_coligados_agropecuaria has 2533 (12.7%) zeros | Zeros |
qt_coligados_agropecuaria has 17290 (86.5%) missing values | Missing |
qt_coligados_atividade_alto has constant value "0.0" | Rejected |
qt_coligados_atividade_baixo has constant value "0.0" | Rejected |
qt_coligados_atividade_inativo has constant value "0.0" | Rejected |
qt_coligados_atividade_medio has constant value "0.0" | Rejected |
qt_coligados_atividade_mt_baixo has constant value "0.0" | Rejected |
qt_coligados_ativo is highly correlated with qt_coligados (ρ = 0.99782) | Rejected |
qt_coligados_baixada has 17290 (86.5%) missing values | Missing |
qt_coligados_ccivil is highly skewed (γ1 = 26.521) | Skewed |
qt_coligados_ccivil has 2251 (11.3%) zeros | Zeros |
qt_coligados_ccivil has 17290 (86.5%) missing values | Missing |
qt_coligados_centro has 2549 (12.7%) zeros | Zeros |
qt_coligados_centro has 17290 (86.5%) missing values | Missing |
qt_coligados_comercio has 1422 (7.1%) zeros | Zeros |
qt_coligados_comercio has 17290 (86.5%) missing values | Missing |
qt_coligados_epp has 17290 (86.5%) missing values | Missing |
qt_coligados_exterior has 17290 (86.5%) missing values | Missing |
qt_coligados_inapta is highly correlated with coligada_mais_antiga_baixada (ρ = 0.937) | Rejected |
qt_coligados_industria is highly skewed (γ1 = 25.232) | Skewed |
qt_coligados_industria has 2293 (11.5%) zeros | Zeros |
qt_coligados_industria has 17290 (86.5%) missing values | Missing |
qt_coligados_ltda is highly skewed (γ1 = 24.669) | Skewed |
qt_coligados_ltda has 2587 (12.9%) zeros | Zeros |
qt_coligados_ltda has 17290 (86.5%) missing values | Missing |
qt_coligados_matriz is highly correlated with qt_coligados_ativo (ρ = 0.99805) | Rejected |
qt_coligados_me has 17290 (86.5%) missing values | Missing |
qt_coligados_mei has 17290 (86.5%) missing values | Missing |
qt_coligados_nordeste has 1043 (5.2%) zeros | Zeros |
qt_coligados_nordeste has 17290 (86.5%) missing values | Missing |
qt_coligados_norte has 1813 (9.1%) zeros | Zeros |
qt_coligados_norte has 17290 (86.5%) missing values | Missing |
qt_coligados_nula has constant value "0.0" | Rejected |
qt_coligados_sa is highly correlated with idade_media_coligadas_baixadas (ρ = 0.91043) | Rejected |
qt_coligados_serviço has 1000 (5.0%) zeros | Zeros |
qt_coligados_serviço has 17290 (86.5%) missing values | Missing |
qt_coligados_sudeste is highly correlated with qt_coligados_matriz (ρ = 0.95437) | Rejected |
qt_coligados_sul is highly correlated with coligada_mais_nova_baixada (ρ = 0.91104) | Rejected |
qt_coligados_suspensa has 17290 (86.5%) missing values | Missing |
qt_desligados is highly correlated with qt_admitidos (ρ = 0.93477) | Rejected |
qt_desligados_12meses is highly skewed (γ1 = 35.828) | Skewed |
qt_desligados_12meses has 3181 (15.9%) zeros | Zeros |
qt_desligados_12meses has 15515 (77.6%) missing values | Missing |
qt_ex_funcionarios is highly correlated with qt_desligados (ρ = 1) | Rejected |
qt_filiais is highly skewed (γ1 = 20.326) | Skewed |
qt_filiais has 18181 (90.9%) zeros | Zeros |
qt_funcionarios is highly correlated with idade_de_54_a_58 (ρ = 0.9349) | Rejected |
qt_funcionarios_12meses is highly correlated with qt_funcionarios (ρ = 0.99486) | Rejected |
qt_funcionarios_24meses is highly correlated with qt_funcionarios_12meses (ρ = 0.97492) | Rejected |
qt_funcionarios_coligados is highly correlated with coligada_mais_nova_baixada (ρ = 0.94351) | Rejected |
qt_funcionarios_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.99611) | Rejected |
qt_funcionarios_grupo has 1208 (6.0%) zeros | Zeros |
qt_funcionarios_grupo has 15052 (75.3%) missing values | Missing |
qt_ramos_coligados is highly correlated with idade_media_coligadas_baixadas (ρ = 0.91284) | Rejected |
qt_regioes_coligados has 17290 (86.5%) missing values | Missing |
qt_socios is highly correlated with idade_media_coligadas_baixadas (ρ = 0.93688) | Rejected |
qt_socios_coligados has 17290 (86.5%) missing values | Missing |
qt_socios_feminino has 13768 (68.8%) missing values | Missing |
qt_socios_masculino is highly correlated with qt_socios (ρ = 0.97996) | Rejected |
qt_socios_pep is highly correlated with qt_socios_masculino (ρ = 0.99002) | Rejected |
qt_socios_pf is highly correlated with qt_socios_pep (ρ = 0.98016) | Rejected |
qt_socios_pj has 14854 (74.3%) zeros | Zeros |
qt_socios_pj has 4959 (24.8%) missing values | Missing |
qt_socios_pj_ativos is highly correlated with qt_socios_pj (ρ = 0.96123) | Rejected |
qt_socios_pj_baixados has 19813 (99.1%) missing values | Missing |
qt_socios_pj_inaptos has 19813 (99.1%) missing values | Missing |
qt_socios_pj_nulos has 19813 (99.1%) missing values | Missing |
qt_socios_pj_suspensos has constant value "0.0" | Rejected |
qt_socios_st_regular is highly correlated with qt_socios_pf (ρ = 0.98725) | Rejected |
qt_socios_st_suspensa is highly correlated with min_filiais_coligados (ρ = 1) | Rejected |
qt_ufs_coligados is highly correlated with idade_media_coligadas_baixadas (ρ = 0.92173) | Rejected |
sum_faturamento_estimado_coligadas is highly correlated with max_vl_folha_coligados (ρ = 0.9227) | Rejected |
total is highly correlated with qt_funcionarios_24meses (ρ = 0.96062) | Rejected |
total_filiais_coligados is highly correlated with max_vl_folha_coligados (ρ = 0.92829) | Rejected |
tx_crescimento_12meses is highly skewed (γ1 = 49.472) | Skewed |
tx_crescimento_12meses has 1933 (9.7%) zeros | Zeros |
tx_crescimento_12meses has 16819 (84.1%) missing values | Missing |
tx_crescimento_24meses is highly skewed (γ1 = 37.678) | Skewed |
tx_crescimento_24meses has 1185 (5.9%) zeros | Zeros |
tx_crescimento_24meses has 16772 (83.9%) missing values | Missing |
tx_rotatividade has 3489 (17.4%) zeros | Zeros |
tx_rotatividade has 15515 (77.6%) missing values | Missing |
vl_faturamento_estimado_aux is highly correlated with total (ρ = 0.97315) | Rejected |
vl_faturamento_estimado_grupo_aux is highly skewed (γ1 = 29.359) | Skewed |
vl_faturamento_estimado_grupo_aux has 1154 (5.8%) missing values | Missing |
vl_folha_coligados is highly correlated with total_filiais_coligados (ρ = 0.91586) | Rejected |
vl_folha_coligados_gp is highly correlated with vl_folha_coligados (ρ = 0.922) | Rejected |
vl_frota is highly correlated with qt_socios_pj_inaptos (ρ = 0.9575) | Rejected |
vl_idade_maxima_socios_pj is highly correlated with idade_media_coligadas_baixadas (ρ = 0.98969) | Rejected |
vl_idade_media_socios_pj is highly correlated with vl_idade_maxima_socios_pj (ρ = 0.95057) | Rejected |
vl_idade_minima_socios_pj is highly correlated with vl_idade_media_socios_pj (ρ = 0.91241) | Rejected |
vl_potenc_cons_oleo_gas has 19845 (99.2%) missing values | Missing |
vl_total_tancagem is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.99476) | Rejected |
vl_total_tancagem_grupo is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.90065) | Rejected |
vl_total_veiculos_antt is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.99317) | Rejected |
vl_total_veiculos_antt_grupo is highly correlated with vl_total_veiculos_antt (ρ = 1) | Rejected |
vl_total_veiculos_leves is highly correlated with vl_total_veiculos_antt_grupo (ρ = 0.9581) | Rejected |
vl_total_veiculos_leves_grupo is highly correlated with vl_total_veiculos_antt (ρ = 0.95917) | Rejected |
vl_total_veiculos_pesados is highly correlated with vl_total_veiculos_antt_grupo (ρ = 0.97758) | Rejected |
vl_total_veiculos_pesados_grupo is highly correlated with vl_total_veiculos_antt (ρ = 0.97729) | Rejected |
coligada_mais_antiga_ativa
Numeric
| Distinct count | 2256 |
|---|---|
| Unique (%) | 11.3% |
| Missing (%) | 86.5% |
| Missing (n) | 17302 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 220.21 |
|---|---|
| Minimum | 0.6 |
| Maximum | 903.17 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 28.767 |
| Q1 | 95.55 |
| Median | 189.92 |
| Q3 | 303.69 |
| 95-th percentile | 579.39 |
| Maximum | 903.17 |
| Range | 902.57 |
| Interquartile range | 208.14 |
Descriptive statistics
| Standard deviation | 160.27 |
|---|---|
| Coef of variation | 0.7278 |
| Kurtosis | 0.687 |
| Mean | 220.21 |
| MAD | 126.46 |
| Skewness | 1.0284 |
| Sum | 5.9413e+05 |
| Variance | 25686 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 629.97 | 15 | 0.1% | |
| 643.03 | 10 | 0.1% | |
| 635.07 | 7 | < 0.1% | |
| 635 | 6 | < 0.1% | |
| 263.63 | 5 | < 0.1% | |
| 188.23 | 4 | < 0.1% | |
| 43.567 | 4 | < 0.1% | |
| 108.13 | 4 | < 0.1% | |
| 385.07 | 4 | < 0.1% | |
| 70.333 | 4 | < 0.1% | |
| Other values (2245) | 2635 | 13.2% | |
| (Missing) | 17302 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.6 | 1 | < 0.1% | |
| 1.7667 | 1 | < 0.1% | |
| 1.8 | 1 | < 0.1% | |
| 1.9333 | 1 | < 0.1% | |
| 2.1667 | 2 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 903.17 | 3 | < 0.1% | |
| 820.1 | 1 | < 0.1% | |
| 808.67 | 2 | < 0.1% | |
| 788.6 | 1 | < 0.1% | |
| 721.4 | 1 | < 0.1% |
coligada_mais_antiga_baixada
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | > 99.9% |
| Missing (n) | 19990 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 101.51 |
|---|---|
| Minimum | 19.2 |
| Maximum | 190.9 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 19.2 |
|---|---|
| 5-th percentile | 24.975 |
| Q1 | 64.517 |
| Median | 83.967 |
| Q3 | 162.12 |
| 95-th percentile | 190.9 |
| Maximum | 190.9 |
| Range | 171.7 |
| Interquartile range | 97.6 |
Descriptive statistics
| Standard deviation | 65.161 |
|---|---|
| Coef of variation | 0.64192 |
| Kurtosis | -1.2251 |
| Mean | 101.51 |
| MAD | 53.087 |
| Skewness | 0.54341 |
| Sum | 1015.1 |
| Variance | 4246 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 83.967 | 4 | < 0.1% | |
| 190.9 | 2 | < 0.1% | |
| 58.033 | 1 | < 0.1% | |
| 32.033 | 1 | < 0.1% | |
| 188.17 | 1 | < 0.1% | |
| 19.2 | 1 | < 0.1% | |
| (Missing) | 19990 | > 99.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 19.2 | 1 | < 0.1% | |
| 32.033 | 1 | < 0.1% | |
| 58.033 | 1 | < 0.1% | |
| 83.967 | 4 | < 0.1% | |
| 188.17 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 190.9 | 2 | < 0.1% | |
| 188.17 | 1 | < 0.1% | |
| 83.967 | 4 | < 0.1% | |
| 58.033 | 1 | < 0.1% | |
| 32.033 | 1 | < 0.1% |
coligada_mais_nova_ativa
Numeric
| Distinct count | 1976 |
|---|---|
| Unique (%) | 9.9% |
| Missing (%) | 86.5% |
| Missing (n) | 17302 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 110.8 |
|---|---|
| Minimum | 0.6 |
| Maximum | 635.43 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 4.8333 |
| Q1 | 31.742 |
| Median | 79.3 |
| Q3 | 159.56 |
| 95-th percentile | 330.8 |
| Maximum | 635.43 |
| Range | 634.83 |
| Interquartile range | 127.82 |
Descriptive statistics
| Standard deviation | 106.57 |
|---|---|
| Coef of variation | 0.96187 |
| Kurtosis | 2.5365 |
| Mean | 110.8 |
| MAD | 81.511 |
| Skewness | 1.5325 |
| Sum | 2.9893e+05 |
| Variance | 11358 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 470.13 | 14 | 0.1% | |
| 9.2667 | 12 | 0.1% | |
| 1.5333 | 10 | 0.1% | |
| 2.7 | 9 | < 0.1% | |
| 3.8 | 6 | < 0.1% | |
| 78.333 | 6 | < 0.1% | |
| 16.267 | 6 | < 0.1% | |
| 71.467 | 5 | < 0.1% | |
| 27.633 | 5 | < 0.1% | |
| 13.033 | 5 | < 0.1% | |
| Other values (1965) | 2620 | 13.1% | |
| (Missing) | 17302 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.6 | 1 | < 0.1% | |
| 1.3 | 1 | < 0.1% | |
| 1.4667 | 2 | < 0.1% | |
| 1.5 | 3 | < 0.1% | |
| 1.5333 | 10 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 635.43 | 1 | < 0.1% | |
| 634.4 | 1 | < 0.1% | |
| 634.13 | 1 | < 0.1% | |
| 604.37 | 1 | < 0.1% | |
| 598.6 | 1 | < 0.1% |
coligada_mais_nova_baixada
Highly correlated
This variable is highly correlated with coligada_mais_antiga_ativa and should be ignored for analysis
| Correlation | 0.94271 |
|---|
de_faixa_faturamento_estimado
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 5.8% |
| Missing (n) | 1154 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 2000 |
| Other values (8) | 686 |
| (Missing) | 1154 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 11790 | 59.0% | |
| ATE R$ 81.000,00 | 4370 | 21.9% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 2000 | 10.0% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 432 | 2.2% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 87 | 0.4% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 85 | 0.4% | |
| SEM INFORMACAO | 39 | 0.2% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 28 | 0.1% | |
| DE R$ 100.000.000,01 A R$ 300.000.000,00 | 11 | 0.1% | |
| ACIMA DE 1 BILHAO DE REAIS | 2 | < 0.1% | |
| (Missing) | 1154 | 5.8% |
| Max length | 40 |
|---|---|
| Mean length | 26.554 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_faixa_faturamento_estimado_grupo
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 5.8% |
| Missing (n) | 1154 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | |
| Other values (8) | 1252 |
| (Missing) | 1154 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 10944 | 54.7% | |
| ATE R$ 81.000,00 | 4343 | 21.7% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 2307 | 11.5% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 562 | 2.8% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 179 | 0.9% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 148 | 0.7% | |
| ACIMA DE 1 BILHAO DE REAIS | 137 | 0.7% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 117 | 0.6% | |
| DE R$ 100.000.000,01 A R$ 300.000.000,00 | 62 | 0.3% | |
| DE R$ 500.000.000,01 A 1 BILHAO DE REAIS | 33 | 0.2% | |
| (Missing) | 1154 | 5.8% |
| Max length | 40 |
|---|---|
| Mean length | 26.781 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_indicador_telefone
Constant
This variable is constant and should be ignored for analysis
| Constant value | BOA |
|---|
de_natureza_juridica
Categorical
| Distinct count | 45 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| EMPRESARIO INDIVIDUAL | |
|---|---|
| SOCIEDADE EMPRESARIA LIMITADA | |
| ASSOCIACAO PRIVADA | 1314 |
| Other values (42) | 1897 |
| Value | Count | Frequency (%) | |
| EMPRESARIO INDIVIDUAL | 12904 | 64.5% | |
| SOCIEDADE EMPRESARIA LIMITADA | 3885 | 19.4% | |
| ASSOCIACAO PRIVADA | 1314 | 6.6% | |
| EMPRESA INDIVIDUAL DE RESPONSABILIDADE LIMITADA DE NATUREZA EMPRESARIA | 656 | 3.3% | |
| ORGAO DE DIRECAO LOCAL DE PARTIDO POLITICO | 313 | 1.6% | |
| ORGANIZACAO RELIGIOSA | 104 | 0.5% | |
| CONDOMINIO EDILICIO | 95 | 0.5% | |
| CANDIDATO A CARGO POLITICO ELETIVO | 80 | 0.4% | |
| ENTIDADE SINDICAL | 76 | 0.4% | |
| SOCIEDADE ANONIMA FECHADA | 68 | 0.3% | |
| Other values (35) | 505 | 2.5% |
| Max length | 70 |
|---|---|
| Mean length | 24.474 |
| Min length | 9 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_nivel_atividade
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 2.2% |
| Missing (n) | 448 |
| MEDIA | |
|---|---|
| ALTA | |
| BAIXA | |
| (Missing) | 448 |
| Value | Count | Frequency (%) | |
| MEDIA | 9375 | 46.9% | |
| ALTA | 6593 | 33.0% | |
| BAIXA | 3404 | 17.0% | |
| MUITO BAIXA | 180 | 0.9% | |
| (Missing) | 448 | 2.2% |
| Max length | 11 |
|---|---|
| Mean length | 4.6795 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_ramo
Categorical
| Distinct count | 33 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO VAREJISTA | |
|---|---|
| SERVICOS DIVERSOS | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 1316 |
| Other values (30) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 7492 | 37.5% | |
| SERVICOS DIVERSOS | 2595 | 13.0% | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 1316 | 6.6% | |
| INDUSTRIA DA CONSTRUCAO | 1132 | 5.7% | |
| COMERCIO E REPARACAO DE VEICULOS | 962 | 4.8% | |
| SERVICOS ADMINISTRATIVOS | 937 | 4.7% | |
| BENS DE CONSUMO | 913 | 4.6% | |
| SERVICOS PROFISSIONAIS, TECNICOS E CIENTIFICOS | 784 | 3.9% | |
| COMERCIO POR ATACADO | 692 | 3.5% | |
| TRANSPORTE, ARMAZENAGEM E CORREIO | 666 | 3.3% | |
| Other values (23) | 2511 | 12.6% |
| Max length | 49 |
|---|---|
| Mean length | 22.115 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_saude_rescencia
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 3.0% |
| Missing (n) | 608 |
| ACIMA DE 1 ANO | |
|---|---|
| ATE 1 ANO | 1665 |
| SEM INFORMACAO | 1297 |
| (Missing) | 608 |
| Value | Count | Frequency (%) | |
| ACIMA DE 1 ANO | 16430 | 82.2% | |
| ATE 1 ANO | 1665 | 8.3% | |
| SEM INFORMACAO | 1297 | 6.5% | |
| (Missing) | 608 | 3.0% |
| Max length | 14 |
|---|---|
| Mean length | 13.249 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_saude_tributaria
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 3.0% |
| Missing (n) | 608 |
| VERDE | |
|---|---|
| AZUL | |
| AMARELO | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| VERDE | 6379 | 31.9% | |
| AZUL | 4481 | 22.4% | |
| AMARELO | 3951 | 19.8% | |
| CINZA | 2772 | 13.9% | |
| LARANJA | 1579 | 7.9% | |
| VERMELHO | 230 | 1.1% | |
| (Missing) | 608 | 3.0% |
| Max length | 8 |
|---|---|
| Mean length | 5.3026 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
dt_situacao
Categorical
| Distinct count | 4215 |
|---|---|
| Unique (%) | 21.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2005-11-03 | 2995 |
|---|---|
| 2006-12-21 | 102 |
| 2006-12-01 | 80 |
| Other values (4212) |
| Value | Count | Frequency (%) | |
| 2005-11-03 | 2995 | 15.0% | |
| 2006-12-21 | 102 | 0.5% | |
| 2006-12-01 | 80 | 0.4% | |
| 2005-08-27 | 77 | 0.4% | |
| 2005-09-24 | 68 | 0.3% | |
| 1998-07-28 | 67 | 0.3% | |
| 2010-05-15 | 65 | 0.3% | |
| 2006-12-02 | 51 | 0.3% | |
| 2004-10-30 | 43 | 0.2% | |
| 2004-10-16 | 36 | 0.2% | |
| Other values (4205) | 16416 | 82.1% |
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
empsetorcensitariofaixarendapopulacao
Numeric
| Distinct count | 6857 |
|---|---|
| Unique (%) | 34.3% |
| Missing (%) | 31.3% |
| Missing (n) | 6268 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1302.2 |
|---|---|
| Minimum | 110.3 |
| Maximum | 30862 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 110.3 |
|---|---|
| 5-th percentile | 422.84 |
| Q1 | 668.88 |
| Median | 946.33 |
| Q3 | 1500 |
| 95-th percentile | 3475 |
| Maximum | 30862 |
| Range | 30752 |
| Interquartile range | 831.12 |
Descriptive statistics
| Standard deviation | 1112.4 |
|---|---|
| Coef of variation | 0.85419 |
| Kurtosis | 51.235 |
| Mean | 1302.2 |
| MAD | 715.32 |
| Skewness | 4.2975 |
| Sum | 1.7882e+07 |
| Variance | 1.2374e+06 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 1549.1 | 101 | 0.5% | |
| 2361.4 | 39 | 0.2% | |
| 1086 | 33 | 0.2% | |
| 3072 | 31 | 0.2% | |
| 3940.1 | 28 | 0.1% | |
| 786.74 | 23 | 0.1% | |
| 2019.4 | 22 | 0.1% | |
| 1699.2 | 19 | 0.1% | |
| 1910.8 | 19 | 0.1% | |
| 2679.5 | 18 | 0.1% | |
| Other values (6846) | 13399 | 67.0% | |
| (Missing) | 6268 | 31.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 110.3 | 1 | < 0.1% | |
| 169.28 | 2 | < 0.1% | |
| 180 | 1 | < 0.1% | |
| 182.94 | 2 | < 0.1% | |
| 183.71 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 30862 | 1 | < 0.1% | |
| 13793 | 3 | < 0.1% | |
| 12513 | 5 | < 0.1% | |
| 10469 | 4 | < 0.1% | |
| 10039 | 7 | < 0.1% |
faturamento_est_coligados
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.91204 |
|---|
faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.99793 |
|---|
fl_antt
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| False | |
|---|---|
| True | 133 |
| (Missing) | 74 |
| Value | Count | Frequency (%) | |
| False | 19793 | 99.0% | |
| True | 133 | 0.7% | |
| (Missing) | 74 | 0.4% |
fl_email
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 11044 | 55.2% | |
| True | 8956 | 44.8% |
fl_epp
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_ltda
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 52 |
| Value | Count | Frequency (%) | |
| False | 19948 | 99.7% | |
| True | 52 | 0.3% |
fl_matriz
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False | 1157 |
| Value | Count | Frequency (%) | |
| True | 18843 | 94.2% | |
| False | 1157 | 5.8% |
fl_me
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 54 |
| Value | Count | Frequency (%) | |
| False | 19946 | 99.7% | |
| True | 54 | 0.3% |
fl_mei
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 13405 | 67.0% | |
| True | 6595 | 33.0% |
fl_optante_simei
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 17.7% |
| Missing (n) | 3538 |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) | |
| False | 12291 | 61.5% | |
| True | 4171 | 20.9% | |
| (Missing) | 3538 | 17.7% |
fl_optante_simples
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 17.7% |
| Missing (n) | 3538 |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) | |
| True | 8843 | 44.2% | |
| False | 7619 | 38.1% | |
| (Missing) | 3538 | 17.7% |
fl_passivel_iss
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| True | |
|---|---|
| False | |
| (Missing) | 74 |
| Value | Count | Frequency (%) | |
| True | 11477 | 57.4% | |
| False | 8449 | 42.2% | |
| (Missing) | 74 | 0.4% |
fl_rm
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| NAO | |
|---|---|
| SIM |
| Value | Count | Frequency (%) | |
| NAO | 10288 | 51.4% | |
| SIM | 9712 | 48.6% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
fl_sa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 333 |
| Value | Count | Frequency (%) | |
| False | 19667 | 98.3% | |
| True | 333 | 1.7% |
fl_simples_irregular
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| False | |
|---|---|
| True | 18 |
| (Missing) | 74 |
| Value | Count | Frequency (%) | |
| False | 19908 | 99.5% | |
| True | 18 | 0.1% | |
| (Missing) | 74 | 0.4% |
fl_spa
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| False | |
|---|---|
| True | 10 |
| (Missing) | 74 |
| Value | Count | Frequency (%) | |
| False | 19916 | 99.6% | |
| True | 10 | 0.1% | |
| (Missing) | 74 | 0.4% |
fl_st_especial
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 6 |
| Value | Count | Frequency (%) | |
| False | 19994 | > 99.9% | |
| True | 6 | < 0.1% |
fl_telefone
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 14602 | 73.0% | |
| False | 5398 | 27.0% |
fl_veiculo
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| False | |
|---|---|
| True | 1319 |
| (Missing) | 74 |
| Value | Count | Frequency (%) | |
| False | 18607 | 93.0% | |
| True | 1319 | 6.6% | |
| (Missing) | 74 | 0.4% |
grau_instrucao_macro_analfabeto
Numeric
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 99.7% |
| Missing (n) | 19930 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.4571 |
|---|---|
| Minimum | 1 |
| Maximum | 41 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7.55 |
| Maximum | 41 |
| Range | 40 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 5.2022 |
|---|---|
| Coef of variation | 2.1172 |
| Kurtosis | 45.098 |
| Mean | 2.4571 |
| MAD | 2.1135 |
| Skewness | 6.3353 |
| Sum | 172 |
| Variance | 27.063 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 47 | 0.2% | |
| 2 | 12 | 0.1% | |
| 3 | 5 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 14 | 1 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| (Missing) | 19930 | 99.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 47 | 0.2% | |
| 2 | 12 | 0.1% | |
| 3 | 5 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 41 | 1 | < 0.1% | |
| 14 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% |
grau_instrucao_macro_desconhecido
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
grau_instrucao_macro_escolaridade_fundamental
Numeric
| Distinct count | 68 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 94.5% |
| Missing (n) | 18909 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.4308 |
|---|---|
| Minimum | 1 |
| Maximum | 971 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 4 |
| 95-th percentile | 23.5 |
| Maximum | 971 |
| Range | 970 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 43.108 |
|---|---|
| Coef of variation | 5.1131 |
| Kurtosis | 279.31 |
| Mean | 8.4308 |
| MAD | 11.054 |
| Skewness | 15.003 |
| Sum | 9198 |
| Variance | 1858.3 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 511 | 2.6% | |
| 2 | 180 | 0.9% | |
| 3 | 104 | 0.5% | |
| 4 | 63 | 0.3% | |
| 5 | 36 | 0.2% | |
| 6 | 28 | 0.1% | |
| 7 | 24 | 0.1% | |
| 8 | 13 | 0.1% | |
| 11 | 10 | 0.1% | |
| 10 | 10 | 0.1% | |
| Other values (57) | 112 | 0.6% | |
| (Missing) | 18909 | 94.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 511 | 2.6% | |
| 2 | 180 | 0.9% | |
| 3 | 104 | 0.5% | |
| 4 | 63 | 0.3% | |
| 5 | 36 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 971 | 1 | < 0.1% | |
| 586 | 1 | < 0.1% | |
| 471 | 1 | < 0.1% | |
| 437 | 1 | < 0.1% | |
| 233 | 1 | < 0.1% |
grau_instrucao_macro_escolaridade_media
Numeric
| Distinct count | 108 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 85.3% |
| Missing (n) | 17060 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10.011 |
|---|---|
| Minimum | 1 |
| Maximum | 2387 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 3 |
| Q3 | 6 |
| 95-th percentile | 28 |
| Maximum | 2387 |
| Range | 2386 |
| Interquartile range | 5 |
Descriptive statistics
| Standard deviation | 61.126 |
|---|---|
| Coef of variation | 6.1058 |
| Kurtosis | 966.51 |
| Mean | 10.011 |
| MAD | 11.908 |
| Skewness | 27.975 |
| Sum | 29433 |
| Variance | 3736.4 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 932 | 4.7% | |
| 2 | 487 | 2.4% | |
| 3 | 308 | 1.5% | |
| 4 | 215 | 1.1% | |
| 5 | 148 | 0.7% | |
| 6 | 119 | 0.6% | |
| 7 | 110 | 0.5% | |
| 8 | 69 | 0.3% | |
| 9 | 63 | 0.3% | |
| 10 | 49 | 0.2% | |
| Other values (97) | 440 | 2.2% | |
| (Missing) | 17060 | 85.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 932 | 4.7% | |
| 2 | 487 | 2.4% | |
| 3 | 308 | 1.5% | |
| 4 | 215 | 1.1% | |
| 5 | 148 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2387 | 1 | < 0.1% | |
| 1662 | 1 | < 0.1% | |
| 523 | 1 | < 0.1% | |
| 473 | 1 | < 0.1% | |
| 471 | 1 | < 0.1% |
grau_instrucao_macro_escolaridade_superior
Numeric
| Distinct count | 71 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 95.3% |
| Missing (n) | 19058 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 16.412 |
|---|---|
| Minimum | 1 |
| Maximum | 2652 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 4 |
| 95-th percentile | 32.9 |
| Maximum | 2652 |
| Range | 2651 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 128.7 |
|---|---|
| Coef of variation | 7.8419 |
| Kurtosis | 268.31 |
| Mean | 16.412 |
| MAD | 25.271 |
| Skewness | 15.476 |
| Sum | 15460 |
| Variance | 16564 |
| Memory size | 312.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 468 | 2.3% | |
| 2 | 155 | 0.8% | |
| 3 | 71 | 0.4% | |
| 4 | 46 | 0.2% | |
| 6 | 25 | 0.1% | |
| 5 | 19 | 0.1% | |
| 7 | 15 | 0.1% | |
| 9 | 12 | 0.1% | |
| 8 | 11 | 0.1% | |
| 12 | 8 | < 0.1% | |
| Other values (60) | 112 | 0.6% | |
| (Missing) | 19058 | 95.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 468 | 2.3% | |
| 2 | 155 | 0.8% | |
| 3 | 71 | 0.4% | |
| 4 | 46 | 0.2% | |
| 5 | 19 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2652 | 1 | < 0.1% | |
| 1820 | 1 | < 0.1% | |
| 1794 | 1 | < 0.1% | |
| 836 | 1 | < 0.1% | |
| 624 | 1 | < 0.1% |
id
Categorical, Unique
| First 5 values |
|---|
| 000771209cccfe8b7819f2b4268eed64e0bc791f42ac9b54fe8780bcdc3b1c59 |
| 00100f97371ee970819d79d123008a51a657bd953c9f5f10a8d6bf2a76cc16cb |
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f838dc889b75c6f9af1 |
| 0013cb64d383b0a8279866a69d4145500ae3e20fd848c90f6f828db4c72cc0f4 |
| 001555421ec37ae3440e5a070cc78f36f74146df2d4b1989c80f4ea404cbd028 |
| Last 5 values |
|---|
| ffec2f6b21fff2a8379e133339634e6a39240dc55c956c17d1c03bb203141966 |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f73092c10d21a1d454d1fd |
| fff06822a7e843723dbe870e65ac0a447082fb467a657e13f466c2cf4c9bfef9 |
| fff5a3249f08ad99ffd8b13f3f29c366619b06c9d479c5a9e5115277b83ec11a |
| fff5af950eefc667f4b9648f497eae11b53b92ecbe5e508724373e5f8f2eec34 |
First 5 values
| Value | Count | Frequency (%) | |
| 000771209cccfe8b7819f2b4268eed64e0bc791f42ac9b54fe8780bcdc3b1c59 | 1 | < 0.1% | |
| 00100f97371ee970819d79d123008a51a657bd953c9f5f10a8d6bf2a76cc16cb | 1 | < 0.1% | |
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f838dc889b75c6f9af1 | 1 | < 0.1% | |
| 0013cb64d383b0a8279866a69d4145500ae3e20fd848c90f6f828db4c72cc0f4 | 1 | < 0.1% | |
| 001555421ec37ae3440e5a070cc78f36f74146df2d4b1989c80f4ea404cbd028 | 1 | < 0.1% |
Last 5 values
| Value | Count | Frequency (%) | |
| fff5af950eefc667f4b9648f497eae11b53b92ecbe5e508724373e5f8f2eec34 | 1 | < 0.1% | |
| fff5a3249f08ad99ffd8b13f3f29c366619b06c9d479c5a9e5115277b83ec11a | 1 | < 0.1% | |
| fff06822a7e843723dbe870e65ac0a447082fb467a657e13f466c2cf4c9bfef9 | 1 | < 0.1% | |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f73092c10d21a1d454d1fd | 1 | < 0.1% | |
| ffec2f6b21fff2a8379e133339634e6a39240dc55c956c17d1c03bb203141966 | 1 | < 0.1% |
idade_acima_de_58
Numeric
| Distinct count | 34 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 97.6% |
| Missing (n) | 19524 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 11.931 |
|---|---|
| Minimum | 1 |
| Maximum | 1844 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 18 |
| Maximum | 1844 |
| Range | 1843 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 107.42 |
|---|---|
| Coef of variation | 9.0033 |
| Kurtosis | 237.1 |
| Mean | 11.931 |
| MAD | 18.993 |
| Skewness | 15.1 |
| Sum | 5679 |
| Variance | 11538 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 295 | 1.5% | |
| 2 | 79 | 0.4% | |
| 3 | 31 | 0.2% | |
| 4 | 12 | 0.1% | |
| 6 | 10 | 0.1% | |
| 5 | 8 | < 0.1% | |
| 8 | 4 | < 0.1% | |
| 18 | 4 | < 0.1% | |
| 24 | 3 | < 0.1% | |
| 45 | 3 | < 0.1% | |
| Other values (23) | 27 | 0.1% | |
| (Missing) | 19524 | 97.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 295 | 1.5% | |
| 2 | 79 | 0.4% | |
| 3 | 31 | 0.2% | |
| 4 | 12 | 0.1% | |
| 5 | 8 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1844 | 1 | < 0.1% | |
| 1401 | 1 | < 0.1% | |
| 216 | 1 | < 0.1% | |
| 212 | 1 | < 0.1% | |
| 143 | 1 | < 0.1% |
idade_ate_18
Numeric
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 99.4% |
| Missing (n) | 19888 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.6964 |
|---|---|
| Minimum | 1 |
| Maximum | 8 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 5.45 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 1.4694 |
|---|---|
| Coef of variation | 0.8662 |
| Kurtosis | 6.8426 |
| Mean | 1.6964 |
| MAD | 0.9949 |
| Skewness | 2.6233 |
| Sum | 190 |
| Variance | 2.1593 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 80 | 0.4% | |
| 2 | 14 | 0.1% | |
| 3 | 8 | < 0.1% | |
| 6 | 4 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| (Missing) | 19888 | 99.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 80 | 0.4% | |
| 2 | 14 | 0.1% | |
| 3 | 8 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8 | 2 | < 0.1% | |
| 6 | 4 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 3 | 8 | < 0.1% |
idade_de_19_a_23
Numeric
| Distinct count | 29 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 94.3% |
| Missing (n) | 18870 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.8584 |
|---|---|
| Minimum | 1 |
| Maximum | 50 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 3 |
| 95-th percentile | 9 |
| Maximum | 50 |
| Range | 49 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 4.5512 |
|---|---|
| Coef of variation | 1.5922 |
| Kurtosis | 39.138 |
| Mean | 2.8584 |
| MAD | 2.2592 |
| Skewness | 5.5398 |
| Sum | 3230 |
| Variance | 20.713 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 576 | 2.9% | |
| 2 | 240 | 1.2% | |
| 3 | 108 | 0.5% | |
| 4 | 51 | 0.3% | |
| 5 | 45 | 0.2% | |
| 6 | 19 | 0.1% | |
| 7 | 19 | 0.1% | |
| 9 | 12 | 0.1% | |
| 8 | 11 | 0.1% | |
| 10 | 9 | < 0.1% | |
| Other values (18) | 40 | 0.2% | |
| (Missing) | 18870 | 94.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 576 | 2.9% | |
| 2 | 240 | 1.2% | |
| 3 | 108 | 0.5% | |
| 4 | 51 | 0.3% | |
| 5 | 45 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 50 | 1 | < 0.1% | |
| 49 | 1 | < 0.1% | |
| 43 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% | |
| 34 | 2 | < 0.1% |
idade_de_24_a_28
Numeric
| Distinct count | 51 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 91.8% |
| Missing (n) | 18368 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.8444 |
|---|---|
| Minimum | 1 |
| Maximum | 144 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 3 |
| 95-th percentile | 12 |
| Maximum | 144 |
| Range | 143 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 9.001 |
|---|---|
| Coef of variation | 2.3414 |
| Kurtosis | 108.48 |
| Mean | 3.8444 |
| MAD | 3.5915 |
| Skewness | 9.0596 |
| Sum | 6274 |
| Variance | 81.018 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 783 | 3.9% | |
| 2 | 305 | 1.5% | |
| 3 | 167 | 0.8% | |
| 4 | 88 | 0.4% | |
| 5 | 59 | 0.3% | |
| 6 | 42 | 0.2% | |
| 8 | 29 | 0.1% | |
| 7 | 28 | 0.1% | |
| 9 | 18 | 0.1% | |
| 10 | 15 | 0.1% | |
| Other values (40) | 98 | 0.5% | |
| (Missing) | 18368 | 91.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 783 | 3.9% | |
| 2 | 305 | 1.5% | |
| 3 | 167 | 0.8% | |
| 4 | 88 | 0.4% | |
| 5 | 59 | 0.3% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 144 | 1 | < 0.1% | |
| 133 | 1 | < 0.1% | |
| 128 | 1 | < 0.1% | |
| 110 | 1 | < 0.1% | |
| 90 | 1 | < 0.1% |
idade_de_29_a_33
Numeric
| Distinct count | 59 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 91.5% |
| Missing (n) | 18292 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4.9327 |
|---|---|
| Minimum | 1 |
| Maximum | 411 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 4 |
| 95-th percentile | 15 |
| Maximum | 411 |
| Range | 410 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 18.928 |
|---|---|
| Coef of variation | 3.8373 |
| Kurtosis | 268.81 |
| Mean | 4.9327 |
| MAD | 5.2876 |
| Skewness | 14.618 |
| Sum | 8425 |
| Variance | 358.27 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 832 | 4.2% | |
| 2 | 293 | 1.5% | |
| 3 | 154 | 0.8% | |
| 4 | 93 | 0.5% | |
| 5 | 80 | 0.4% | |
| 8 | 37 | 0.2% | |
| 6 | 34 | 0.2% | |
| 7 | 32 | 0.2% | |
| 9 | 21 | 0.1% | |
| 10 | 11 | 0.1% | |
| Other values (48) | 121 | 0.6% | |
| (Missing) | 18292 | 91.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 832 | 4.2% | |
| 2 | 293 | 1.5% | |
| 3 | 154 | 0.8% | |
| 4 | 93 | 0.5% | |
| 5 | 80 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 411 | 1 | < 0.1% | |
| 405 | 1 | < 0.1% | |
| 222 | 1 | < 0.1% | |
| 213 | 1 | < 0.1% | |
| 182 | 1 | < 0.1% |
idade_de_34_a_38
Highly correlated
This variable is highly correlated with idade_de_29_a_33 and should be ignored for analysis
| Correlation | 0.96376 |
|---|
idade_de_39_a_43
Highly correlated
This variable is highly correlated with idade_de_34_a_38 and should be ignored for analysis
| Correlation | 0.95304 |
|---|
idade_de_44_a_48
Highly correlated
This variable is highly correlated with idade_de_39_a_43 and should be ignored for analysis
| Correlation | 0.92955 |
|---|
idade_de_49_a_53
Highly correlated
This variable is highly correlated with idade_de_44_a_48 and should be ignored for analysis
| Correlation | 0.9679 |
|---|
idade_de_54_a_58
Highly correlated
This variable is highly correlated with idade_de_49_a_53 and should be ignored for analysis
| Correlation | 0.94691 |
|---|
idade_emp_cat
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 a 5 | |
|---|---|
| 5 a 10 | |
| > 20 | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| 1 a 5 | 6103 | 30.5% | |
| 5 a 10 | 5043 | 25.2% | |
| > 20 | 3261 | 16.3% | |
| 10 a 15 | 2045 | 10.2% | |
| <= 1 | 1982 | 9.9% | |
| 15 a 20 | 1566 | 7.8% |
| Max length | 7 |
|---|---|
| Mean length | 5.3511 |
| Min length | 4 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
idade_empresa_anos
Numeric
| Distinct count | 6971 |
|---|---|
| Unique (%) | 34.9% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.8907 |
|---|---|
| Minimum | 0.027397 |
| Maximum | 55.833 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.027397 |
|---|---|
| 5-th percentile | 0.48767 |
| Q1 | 2.7315 |
| Median | 6.6301 |
| Q3 | 14.43 |
| 95-th percentile | 29.875 |
| Maximum | 55.833 |
| Range | 55.805 |
| Interquartile range | 11.699 |
Descriptive statistics
| Standard deviation | 9.5795 |
|---|---|
| Coef of variation | 0.96854 |
| Kurtosis | 1.4746 |
| Mean | 9.8907 |
| MAD | 7.5309 |
| Skewness | 1.375 |
| Sum | 1.9781e+05 |
| Variance | 91.767 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0.20822 | 28 | 0.1% | |
| 0.20548 | 28 | 0.1% | |
| 0.74247 | 23 | 0.1% | |
| 8.0219 | 18 | 0.1% | |
| 0.24384 | 17 | 0.1% | |
| 0.21644 | 17 | 0.1% | |
| 0.20274 | 17 | 0.1% | |
| 0.65753 | 16 | 0.1% | |
| 0.45753 | 16 | 0.1% | |
| 0.55068 | 15 | 0.1% | |
| Other values (6961) | 19805 | 99.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.027397 | 2 | < 0.1% | |
| 0.030137 | 8 | < 0.1% | |
| 0.032877 | 8 | < 0.1% | |
| 0.035616 | 3 | < 0.1% | |
| 0.038356 | 2 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 55.833 | 1 | < 0.1% | |
| 52.942 | 1 | < 0.1% | |
| 52.277 | 2 | < 0.1% | |
| 52.236 | 1 | < 0.1% | |
| 52.2 | 1 | < 0.1% |
idade_maxima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.94271 |
|---|
idade_maxima_socios
Numeric
| Distinct count | 88 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 32.9% |
| Missing (n) | 6586 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.23 |
|---|---|
| Minimum | 8 |
| Maximum | 118 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 34 |
| Median | 43 |
| Q3 | 54 |
| 95-th percentile | 69 |
| Maximum | 118 |
| Range | 110 |
| Interquartile range | 20 |
Descriptive statistics
| Standard deviation | 13.935 |
|---|---|
| Coef of variation | 0.31505 |
| Kurtosis | 0.040116 |
| Mean | 44.23 |
| MAD | 11.338 |
| Skewness | 0.55591 |
| Sum | 5.9330e+05 |
| Variance | 194.18 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 37 | 413 | 2.1% | |
| 36 | 386 | 1.9% | |
| 35 | 385 | 1.9% | |
| 41 | 385 | 1.9% | |
| 39 | 377 | 1.9% | |
| 40 | 372 | 1.9% | |
| 44 | 370 | 1.8% | |
| 33 | 362 | 1.8% | |
| 34 | 355 | 1.8% | |
| 38 | 344 | 1.7% | |
| Other values (77) | 9665 | 48.3% | |
| (Missing) | 6586 | 32.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 8 | 1 | < 0.1% | |
| 13 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 18 | 19 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 118 | 1 | < 0.1% | |
| 110 | 1 | < 0.1% | |
| 106 | 1 | < 0.1% | |
| 104 | 2 | < 0.1% | |
| 97 | 2 | < 0.1% |
idade_media_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.97597 |
|---|
idade_media_coligadas_ativas
Highly correlated
This variable is highly correlated with idade_media_coligadas and should be ignored for analysis
| Correlation | 0.99868 |
|---|
idade_media_coligadas_baixadas
Highly correlated
This variable is highly correlated with idade_media_coligadas_ativas and should be ignored for analysis
| Correlation | 0.92832 |
|---|
idade_media_socios
Highly correlated
This variable is highly correlated with idade_maxima_socios and should be ignored for analysis
| Correlation | 0.95975 |
|---|
idade_minima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_ativa and should be ignored for analysis
| Correlation | 0.99882 |
|---|
idade_minima_socios
Highly correlated
This variable is highly correlated with idade_media_socios and should be ignored for analysis
| Correlation | 0.95683 |
|---|
max_faturamento_est_coligados
Highly correlated
This variable is highly correlated with faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.94776 |
|---|
max_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.99411 |
|---|
max_filiais_coligados
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.97946 |
|---|
max_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.9016 |
|---|
max_meses_servicos
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.97658 |
|---|
max_meses_servicos_all
Numeric
| Distinct count | 1619 |
|---|---|
| Unique (%) | 8.1% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 168.05 |
|---|---|
| Minimum | -0.63333 |
| Maximum | 5099.1 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | -0.63333 |
|---|---|
| 5-th percentile | 5.9667 |
| Q1 | 25.333 |
| Median | 54.667 |
| Q3 | 95.167 |
| 95-th percentile | 250.11 |
| Maximum | 5099.1 |
| Range | 5099.7 |
| Interquartile range | 69.833 |
Descriptive statistics
| Standard deviation | 682.64 |
|---|---|
| Coef of variation | 4.0621 |
| Kurtosis | 46.676 |
| Mean | 168.05 |
| MAD | 200.67 |
| Skewness | 6.9344 |
| Sum | 7.537e+05 |
| Variance | 4.6599e+05 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 93.267 | 33 | 0.2% | |
| 5015 | 28 | 0.1% | |
| 67.867 | 27 | 0.1% | |
| 5064.7 | 26 | 0.1% | |
| 40.5 | 26 | 0.1% | |
| 63.8 | 25 | 0.1% | |
| 88.167 | 25 | 0.1% | |
| 92.233 | 24 | 0.1% | |
| 28.3 | 24 | 0.1% | |
| 33.333 | 23 | 0.1% | |
| Other values (1608) | 4224 | 21.1% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -0.63333 | 1 | < 0.1% | |
| -0.066667 | 1 | < 0.1% | |
| 0 | 5 | < 0.1% | |
| 0.13333 | 1 | < 0.1% | |
| 0.23333 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5099.1 | 12 | 0.1% | |
| 5066.7 | 7 | < 0.1% | |
| 5065.7 | 4 | < 0.1% | |
| 5064.7 | 26 | 0.1% | |
| 5039.3 | 3 | < 0.1% |
max_vl_folha_coligados
Highly correlated
This variable is highly correlated with max_filiais_coligados and should be ignored for analysis
| Correlation | 0.91799 |
|---|
max_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with max_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9144 |
|---|
media_faturamento_est_coligados
Numeric
| Distinct count | 962 |
|---|---|
| Unique (%) | 4.8% |
| Missing (%) | 86.7% |
| Missing (n) | 17332 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.371e+07 |
|---|---|
| Minimum | 41213 |
| Maximum | 6.1297e+09 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 41213 |
|---|---|
| 5-th percentile | 1.6682e+05 |
| Q1 | 2.1e+05 |
| Median | 3.6103e+05 |
| Q3 | 1.1127e+06 |
| 95-th percentile | 1.8733e+07 |
| Maximum | 6.1297e+09 |
| Range | 6.1297e+09 |
| Interquartile range | 9.0275e+05 |
Descriptive statistics
| Standard deviation | 2.3796e+08 |
|---|---|
| Coef of variation | 10.036 |
| Kurtosis | 421.67 |
| Mean | 2.371e+07 |
| MAD | 4.2864e+07 |
| Skewness | 18.882 |
| Sum | 6.3258e+10 |
| Variance | 5.6625e+16 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 939 | 4.7% | |
| 9.3e+05 | 100 | 0.5% | |
| 3.7092e+05 | 46 | 0.2% | |
| 50000 | 44 | 0.2% | |
| 1.8546e+05 | 35 | 0.2% | |
| 5.7e+05 | 29 | 0.1% | |
| 1.2364e+05 | 26 | 0.1% | |
| 5.5637e+05 | 20 | 0.1% | |
| 7.4183e+05 | 20 | 0.1% | |
| 1.6682e+05 | 20 | 0.1% | |
| Other values (951) | 1389 | 6.9% | |
| (Missing) | 17332 | 86.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 41213 | 6 | < 0.1% | |
| 50000 | 44 | 0.2% | |
| 51516 | 1 | < 0.1% | |
| 61819 | 3 | < 0.1% | |
| 82426 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6.1297e+09 | 1 | < 0.1% | |
| 5.6516e+09 | 2 | < 0.1% | |
| 3.4331e+09 | 1 | < 0.1% | |
| 2.274e+09 | 1 | < 0.1% | |
| 2.0492e+09 | 2 | < 0.1% |
media_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.98196 |
|---|
media_filiais_coligados
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.98273 |
|---|
media_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.93916 |
|---|
media_meses_servicos
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.94844 |
|---|
media_meses_servicos_all
Numeric
| Distinct count | 3801 |
|---|---|
| Unique (%) | 19.0% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 43.279 |
|---|---|
| Minimum | -0.63333 |
| Maximum | 5099.1 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | -0.63333 |
|---|---|
| 5-th percentile | 4.6361 |
| Q1 | 13.825 |
| Median | 23.944 |
| Q3 | 40.806 |
| 95-th percentile | 103.88 |
| Maximum | 5099.1 |
| Range | 5099.7 |
| Interquartile range | 26.981 |
Descriptive statistics
| Standard deviation | 158.25 |
|---|---|
| Coef of variation | 3.6564 |
| Kurtosis | 695.6 |
| Mean | 43.279 |
| MAD | 35.851 |
| Skewness | 23.838 |
| Sum | 1.9411e+05 |
| Variance | 25041 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.9333 | 15 | 0.1% | |
| 11.133 | 12 | 0.1% | |
| 1 | 11 | 0.1% | |
| 5.1 | 11 | 0.1% | |
| 4.0667 | 11 | 0.1% | |
| 14.2 | 10 | 0.1% | |
| 8.0333 | 10 | 0.1% | |
| 2.0333 | 10 | 0.1% | |
| 39.467 | 9 | < 0.1% | |
| 8.9667 | 9 | < 0.1% | |
| Other values (3790) | 4377 | 21.9% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -0.63333 | 1 | < 0.1% | |
| -0.066667 | 1 | < 0.1% | |
| 0 | 5 | < 0.1% | |
| 0.13333 | 1 | < 0.1% | |
| 0.23333 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5099.1 | 1 | < 0.1% | |
| 5015 | 2 | < 0.1% | |
| 2539.9 | 1 | < 0.1% | |
| 1785.9 | 1 | < 0.1% | |
| 1734.3 | 1 | < 0.1% |
media_vl_folha_coligados
Numeric
| Distinct count | 695 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 92.4% |
| Missing (n) | 18485 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 5.3355e+06 |
|---|---|
| Minimum | 0 |
| Maximum | 5.2296e+08 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 61819 |
| Q1 | 1.8546e+05 |
| Median | 4.7689e+05 |
| Q3 | 1.6691e+06 |
| 95-th percentile | 1.9745e+07 |
| Maximum | 5.2296e+08 |
| Range | 5.2296e+08 |
| Interquartile range | 1.4837e+06 |
Descriptive statistics
| Standard deviation | 2.4588e+07 |
|---|---|
| Coef of variation | 4.6084 |
| Kurtosis | 175.47 |
| Mean | 5.3355e+06 |
| MAD | 7.9618e+06 |
| Skewness | 11.237 |
| Sum | 8.0833e+09 |
| Variance | 6.0458e+14 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 126 | 0.6% | |
| 1.2364e+05 | 73 | 0.4% | |
| 1.8546e+05 | 61 | 0.3% | |
| 2.4728e+05 | 37 | 0.2% | |
| 20606 | 34 | 0.2% | |
| 3.091e+05 | 29 | 0.1% | |
| 4.3273e+05 | 27 | 0.1% | |
| 3.7092e+05 | 22 | 0.1% | |
| 4.9455e+05 | 21 | 0.1% | |
| 2.0606e+05 | 21 | 0.1% | |
| Other values (684) | 1064 | 5.3% | |
| (Missing) | 18485 | 92.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 10303 | 1 | < 0.1% | |
| 20606 | 34 | 0.2% | |
| 27475 | 1 | < 0.1% | |
| 30910 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5.2296e+08 | 1 | < 0.1% | |
| 3.4343e+08 | 1 | < 0.1% | |
| 2.6754e+08 | 1 | < 0.1% | |
| 2.2947e+08 | 1 | < 0.1% | |
| 1.6782e+08 | 2 | < 0.1% |
media_vl_folha_coligados_gp
Numeric
| Distinct count | 714 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 92.4% |
| Missing (n) | 18477 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.2033e+07 |
|---|---|
| Minimum | 0 |
| Maximum | 1.3928e+09 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 61819 |
| Q1 | 1.8546e+05 |
| Median | 5.2546e+05 |
| Q3 | 2.1551e+06 |
| 95-th percentile | 4.5519e+07 |
| Maximum | 1.3928e+09 |
| Range | 1.3928e+09 |
| Interquartile range | 1.9696e+06 |
Descriptive statistics
| Standard deviation | 7.3986e+07 |
|---|---|
| Coef of variation | 6.1485 |
| Kurtosis | 206.6 |
| Mean | 1.2033e+07 |
| MAD | 1.9443e+07 |
| Skewness | 13.187 |
| Sum | 1.8326e+10 |
| Variance | 5.4739e+15 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 127 | 0.6% | |
| 1.2364e+05 | 69 | 0.3% | |
| 1.8546e+05 | 55 | 0.3% | |
| 2.4728e+05 | 39 | 0.2% | |
| 20606 | 32 | 0.2% | |
| 3.091e+05 | 27 | 0.1% | |
| 4.3273e+05 | 26 | 0.1% | |
| 4.9455e+05 | 23 | 0.1% | |
| 3.7092e+05 | 23 | 0.1% | |
| 2.0606e+05 | 21 | 0.1% | |
| Other values (703) | 1081 | 5.4% | |
| (Missing) | 18477 | 92.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 10303 | 1 | < 0.1% | |
| 20606 | 32 | 0.2% | |
| 27475 | 1 | < 0.1% | |
| 41213 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.3928e+09 | 1 | < 0.1% | |
| 1.3424e+09 | 1 | < 0.1% | |
| 1.1771e+09 | 1 | < 0.1% | |
| 7.3495e+08 | 1 | < 0.1% | |
| 6.6447e+08 | 1 | < 0.1% |
meses_ultima_contratacaco
Numeric
| Distinct count | 1023 |
|---|---|
| Unique (%) | 5.1% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.159 |
|---|---|
| Minimum | 1.9333 |
| Maximum | 5099.1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.9333 |
|---|---|
| 5-th percentile | 2.9333 |
| Q1 | 8.9667 |
| Median | 30.333 |
| Q3 | 59.733 |
| 95-th percentile | 108.43 |
| Maximum | 5099.1 |
| Range | 5097.1 |
| Interquartile range | 50.767 |
Descriptive statistics
| Standard deviation | 136.51 |
|---|---|
| Coef of variation | 3.0914 |
| Kurtosis | 1256.6 |
| Mean | 44.159 |
| MAD | 33.711 |
| Skewness | 34.023 |
| Sum | 1.9805e+05 |
| Variance | 18635 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.9333 | 128 | 0.6% | |
| 8.0333 | 75 | 0.4% | |
| 5.9667 | 73 | 0.4% | |
| 3.9333 | 71 | 0.4% | |
| 4.9667 | 67 | 0.3% | |
| 8.9667 | 51 | 0.3% | |
| 14.067 | 51 | 0.3% | |
| 24.2 | 50 | 0.2% | |
| 15.1 | 49 | 0.2% | |
| 26.233 | 45 | 0.2% | |
| Other values (1012) | 3825 | 19.1% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.9333 | 1 | < 0.1% | |
| 1.9667 | 2 | < 0.1% | |
| 2 | 5 | < 0.1% | |
| 2.0333 | 5 | < 0.1% | |
| 2.0667 | 9 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5099.1 | 3 | < 0.1% | |
| 332.7 | 1 | < 0.1% | |
| 325.53 | 1 | < 0.1% | |
| 321 | 1 | < 0.1% | |
| 307 | 1 | < 0.1% |
min_faturamento_est_coligados
Numeric
| Distinct count | 171 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 86.7% |
| Missing (n) | 17332 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 7.8437e+05 |
|---|---|
| Minimum | 0 |
| Maximum | 2.6072e+08 |
| Zeros (%) | 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 50000 |
| Q1 | 2.1e+05 |
| Median | 2.1e+05 |
| Q3 | 2.1e+05 |
| 95-th percentile | 1.4837e+06 |
| Maximum | 2.6072e+08 |
| Range | 2.6072e+08 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 6.4999e+06 |
|---|---|
| Coef of variation | 8.2868 |
| Kurtosis | 1062.1 |
| Mean | 7.8437e+05 |
| MAD | 9.8931e+05 |
| Skewness | 29.558 |
| Sum | 2.0927e+09 |
| Variance | 4.2249e+13 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 1549 | 7.7% | |
| 1.2364e+05 | 163 | 0.8% | |
| 9.3e+05 | 113 | 0.6% | |
| 1.8546e+05 | 105 | 0.5% | |
| 3.7092e+05 | 64 | 0.3% | |
| 41213 | 55 | 0.3% | |
| 50000 | 50 | 0.2% | |
| 2.4728e+05 | 40 | 0.2% | |
| 2.0606e+05 | 32 | 0.2% | |
| 5.5637e+05 | 30 | 0.1% | |
| Other values (160) | 467 | 2.3% | |
| (Missing) | 17332 | 86.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 10 | 0.1% | |
| 900 | 1 | < 0.1% | |
| 1899.1 | 1 | < 0.1% | |
| 3832.9 | 1 | < 0.1% | |
| 6000 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2.6072e+08 | 1 | < 0.1% | |
| 1.4589e+08 | 1 | < 0.1% | |
| 6.1253e+07 | 2 | < 0.1% | |
| 4.9301e+07 | 1 | < 0.1% | |
| 4.7704e+07 | 1 | < 0.1% |
min_faturamento_est_coligados_gp
Numeric
| Distinct count | 234 |
|---|---|
| Unique (%) | 1.2% |
| Missing (%) | 86.7% |
| Missing (n) | 17332 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.265e+06 |
|---|---|
| Minimum | 900 |
| Maximum | 8.1945e+08 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 900 |
|---|---|
| 5-th percentile | 82426 |
| Q1 | 2.1e+05 |
| Median | 2.1e+05 |
| Q3 | 3.091e+05 |
| 95-th percentile | 2.0328e+06 |
| Maximum | 8.1945e+08 |
| Range | 8.1945e+08 |
| Interquartile range | 99096 |
Descriptive statistics
| Standard deviation | 1.7359e+07 |
|---|---|
| Coef of variation | 13.722 |
| Kurtosis | 1872.9 |
| Mean | 1.265e+06 |
| MAD | 1.805e+06 |
| Skewness | 40.941 |
| Sum | 3.3751e+09 |
| Variance | 3.0133e+14 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 1498 | 7.5% | |
| 1.2364e+05 | 156 | 0.8% | |
| 9.3e+05 | 104 | 0.5% | |
| 1.8546e+05 | 90 | 0.4% | |
| 3.7092e+05 | 53 | 0.3% | |
| 50000 | 51 | 0.3% | |
| 4.2e+05 | 46 | 0.2% | |
| 41213 | 44 | 0.2% | |
| 2.4728e+05 | 38 | 0.2% | |
| 5.5637e+05 | 24 | 0.1% | |
| Other values (223) | 564 | 2.8% | |
| (Missing) | 17332 | 86.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 900 | 1 | < 0.1% | |
| 3832.9 | 1 | < 0.1% | |
| 6000 | 1 | < 0.1% | |
| 12129 | 1 | < 0.1% | |
| 19079 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8.1945e+08 | 1 | < 0.1% | |
| 2.6072e+08 | 1 | < 0.1% | |
| 1.4589e+08 | 1 | < 0.1% | |
| 8.1346e+07 | 2 | < 0.1% | |
| 7.2741e+07 | 1 | < 0.1% |
min_filiais_coligados
Numeric
| Distinct count | 29 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 96.0% |
| Missing (n) | 19201 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.1952 |
|---|---|
| Minimum | 1 |
| Maximum | 314 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 314 |
| Range | 313 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 13.58 |
|---|---|
| Coef of variation | 4.25 |
| Kurtosis | 357.82 |
| Mean | 3.1952 |
| MAD | 3.3214 |
| Skewness | 17.007 |
| Sum | 2553 |
| Variance | 184.41 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 548 | 2.7% | |
| 2 | 95 | 0.5% | |
| 3 | 53 | 0.3% | |
| 4 | 39 | 0.2% | |
| 5 | 11 | 0.1% | |
| 6 | 11 | 0.1% | |
| 23 | 5 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 44 | 3 | < 0.1% | |
| 15 | 3 | < 0.1% | |
| Other values (18) | 27 | 0.1% | |
| (Missing) | 19201 | 96.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 548 | 2.7% | |
| 2 | 95 | 0.5% | |
| 3 | 53 | 0.3% | |
| 4 | 39 | 0.2% | |
| 5 | 11 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 314 | 1 | < 0.1% | |
| 126 | 1 | < 0.1% | |
| 106 | 1 | < 0.1% | |
| 67 | 2 | < 0.1% | |
| 48 | 1 | < 0.1% |
min_funcionarios_coligados_gp
Numeric
| Distinct count | 103 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 91.3% |
| Missing (n) | 18262 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 25.666 |
|---|---|
| Minimum | 0 |
| Maximum | 10836 |
| Zeros (%) | 2.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 2 |
| Q3 | 6 |
| 95-th percentile | 42 |
| Maximum | 10836 |
| Range | 10836 |
| Interquartile range | 6 |
Descriptive statistics
| Standard deviation | 317.39 |
|---|---|
| Coef of variation | 12.366 |
| Kurtosis | 879.84 |
| Mean | 25.666 |
| MAD | 41.498 |
| Skewness | 28.164 |
| Sum | 44607 |
| Variance | 1.0074e+05 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 577 | 2.9% | |
| 1 | 278 | 1.4% | |
| 2 | 161 | 0.8% | |
| 3 | 113 | 0.6% | |
| 4 | 75 | 0.4% | |
| 5 | 60 | 0.3% | |
| 6 | 54 | 0.3% | |
| 7 | 48 | 0.2% | |
| 8 | 42 | 0.2% | |
| 10 | 37 | 0.2% | |
| Other values (92) | 293 | 1.5% | |
| (Missing) | 18262 | 91.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 577 | 2.9% | |
| 1 | 278 | 1.4% | |
| 2 | 161 | 0.8% | |
| 3 | 113 | 0.6% | |
| 4 | 75 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 10836 | 1 | < 0.1% | |
| 6548 | 1 | < 0.1% | |
| 2451 | 1 | < 0.1% | |
| 1340 | 1 | < 0.1% | |
| 963 | 1 | < 0.1% |
min_meses_servicos
Numeric
| Distinct count | 798 |
|---|---|
| Unique (%) | 4.0% |
| Missing (%) | 83.5% |
| Missing (n) | 16703 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 37.116 |
|---|---|
| Minimum | 1.9333 |
| Maximum | 5099.1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.9333 |
|---|---|
| 5-th percentile | 2.7667 |
| Q1 | 6.9667 |
| Median | 23.467 |
| Q3 | 47.567 |
| 95-th percentile | 94.7 |
| Maximum | 5099.1 |
| Range | 5097.1 |
| Interquartile range | 40.6 |
Descriptive statistics
| Standard deviation | 130.34 |
|---|---|
| Coef of variation | 3.5118 |
| Kurtosis | 1379.9 |
| Mean | 37.116 |
| MAD | 30.588 |
| Skewness | 35.627 |
| Sum | 1.2237e+05 |
| Variance | 16989 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.9333 | 124 | 0.6% | |
| 5.9667 | 71 | 0.4% | |
| 3.9333 | 70 | 0.4% | |
| 4.9667 | 66 | 0.3% | |
| 8.0333 | 64 | 0.3% | |
| 8.9667 | 50 | 0.2% | |
| 24.2 | 40 | 0.2% | |
| 28.3 | 40 | 0.2% | |
| 14.067 | 39 | 0.2% | |
| 6.9667 | 39 | 0.2% | |
| Other values (787) | 2694 | 13.5% | |
| (Missing) | 16703 | 83.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.9333 | 1 | < 0.1% | |
| 1.9667 | 2 | < 0.1% | |
| 2 | 5 | < 0.1% | |
| 2.0333 | 5 | < 0.1% | |
| 2.0667 | 9 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5099.1 | 2 | < 0.1% | |
| 528.2 | 1 | < 0.1% | |
| 302.57 | 1 | < 0.1% | |
| 300.23 | 1 | < 0.1% | |
| 291.07 | 1 | < 0.1% |
min_meses_servicos_all
Highly correlated
This variable is highly correlated with meses_ultima_contratacaco and should be ignored for analysis
| Correlation | 0.9754 |
|---|
min_vl_folha_coligados
Numeric
| Distinct count | 168 |
|---|---|
| Unique (%) | 0.8% |
| Missing (%) | 92.4% |
| Missing (n) | 18485 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.6461e+06 |
|---|---|
| Minimum | 0 |
| Maximum | 3.4343e+08 |
| Zeros (%) | 0.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20606 |
| Q1 | 61819 |
| Median | 1.8546e+05 |
| Q3 | 5.1516e+05 |
| 95-th percentile | 2.8911e+06 |
| Maximum | 3.4343e+08 |
| Range | 3.4343e+08 |
| Interquartile range | 4.5334e+05 |
Descriptive statistics
| Standard deviation | 1.2659e+07 |
|---|---|
| Coef of variation | 7.6904 |
| Kurtosis | 398.22 |
| Mean | 1.6461e+06 |
| MAD | 2.491e+06 |
| Skewness | 17.742 |
| Sum | 2.4938e+09 |
| Variance | 1.6024e+14 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 287 | 1.4% | |
| 1.2364e+05 | 156 | 0.8% | |
| 1.8546e+05 | 99 | 0.5% | |
| 20606 | 84 | 0.4% | |
| 2.4728e+05 | 59 | 0.3% | |
| 3.091e+05 | 45 | 0.2% | |
| 1.0303e+05 | 38 | 0.2% | |
| 4.3273e+05 | 34 | 0.2% | |
| 0 | 33 | 0.2% | |
| 3.7092e+05 | 32 | 0.2% | |
| Other values (157) | 648 | 3.2% | |
| (Missing) | 18485 | 92.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.2% | |
| 20606 | 84 | 0.4% | |
| 41213 | 28 | 0.1% | |
| 61819 | 287 | 1.4% | |
| 82426 | 28 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 3.4343e+08 | 1 | < 0.1% | |
| 1.5482e+08 | 1 | < 0.1% | |
| 1.3491e+08 | 4 | < 0.1% | |
| 6.0397e+07 | 5 | < 0.1% | |
| 5.8357e+07 | 1 | < 0.1% |
min_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with min_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.93663 |
|---|
natureza_juridica_macro
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| OUTROS | |
|---|---|
| ENTIDADES EMPRESARIAIS | |
| ENTIDADES SEM FINS LUCRATIVOS | 1610 |
| Other values (3) | 250 |
| Value | Count | Frequency (%) | |
| OUTROS | 13960 | 69.8% | |
| ENTIDADES EMPRESARIAIS | 4180 | 20.9% | |
| ENTIDADES SEM FINS LUCRATIVOS | 1610 | 8.1% | |
| ADMINISTRACAO PUBLICA | 120 | 0.6% | |
| CARGO POLITICO | 80 | 0.4% | |
| PESSOAS FISICAS | 50 | 0.2% |
| Max length | 29 |
|---|---|
| Mean length | 11.34 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_divisao
Categorical
| Distinct count | 86 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| COMERCIO VAREJISTA | |
|---|---|
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 1652 |
| ALIMENTACAO | 1209 |
| Other values (82) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 7492 | 37.5% | |
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 1652 | 8.3% | |
| ALIMENTACAO | 1209 | 6.0% | |
| COMERCIO E REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 962 | 4.8% | |
| COMERCIO POR ATACADO EXCETO VEICULOS AUTOMOTORES E MOTOCICLETAS | 692 | 3.5% | |
| OUTRAS ATIVIDADES DE SERVICOS PESSOAIS | 658 | 3.3% | |
| SERVICOS ESPECIALIZADOS PARA CONSTRUCAO | 645 | 3.2% | |
| TRANSPORTE TERRESTRE | 492 | 2.5% | |
| EDUCACAO | 463 | 2.3% | |
| SERVICOS DE ESCRITORIO DE APOIO ADMINISTRATIVO E OUTROS SERVICOS PRESTADOS PRINCIPALMENTE AS EMPRESAS | 431 | 2.2% | |
| Other values (75) | 5230 | 26.2% |
| Max length | 120 |
|---|---|
| Mean length | 33.007 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_meso_regiao
Categorical
| Distinct count | 20 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 12.5% |
| Missing (n) | 2505 |
| CENTRO AMAZONENSE | |
|---|---|
| NORTE MARANHENSE | |
| LESTE POTIGUAR | |
| Other values (16) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| CENTRO AMAZONENSE | 3098 | 15.5% | |
| NORTE MARANHENSE | 2641 | 13.2% | |
| LESTE POTIGUAR | 2557 | 12.8% | |
| CENTRO NORTE PIAUIENSE | 1824 | 9.1% | |
| OESTE MARANHENSE | 1119 | 5.6% | |
| OESTE POTIGUAR | 900 | 4.5% | |
| LESTE MARANHENSE | 816 | 4.1% | |
| VALE DO ACRE | 719 | 3.6% | |
| CENTRO MARANHENSE | 624 | 3.1% | |
| SUDOESTE PIAUIENSE | 514 | 2.6% | |
| Other values (9) | 2683 | 13.4% | |
| (Missing) | 2505 | 12.5% |
| Max length | 22 |
|---|---|
| Mean length | 14.606 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_micro_regiao
Categorical
| Distinct count | 74 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 12.5% |
| Missing (n) | 2505 |
| MANAUS | |
|---|---|
| NATAL | 2038 |
| AGLOMERACAO URBANA DE SAO LUIS | 1980 |
| Other values (70) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| MANAUS | 2622 | 13.1% | |
| NATAL | 2038 | 10.2% | |
| AGLOMERACAO URBANA DE SAO LUIS | 1980 | 9.9% | |
| TERESINA | 1443 | 7.2% | |
| IMPERATRIZ | 648 | 3.2% | |
| RIO BRANCO | 617 | 3.1% | |
| MOSSORO | 457 | 2.3% | |
| PINDARE | 348 | 1.7% | |
| MACAIBA | 316 | 1.6% | |
| CAXIAS | 314 | 1.6% | |
| Other values (63) | 6712 | 33.6% | |
| (Missing) | 2505 | 12.5% |
| Max length | 33 |
|---|---|
| Mean length | 11.069 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_segmento
Categorical
| Distinct count | 22 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | |
|---|---|
| OUTRAS ATIVIDADES DE SERVICOS | |
| INDUSTRIAS DE TRANSFORMACAO | 1343 |
| Other values (18) |
| Value | Count | Frequency (%) | |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 9146 | 45.7% | |
| OUTRAS ATIVIDADES DE SERVICOS | 2563 | 12.8% | |
| INDUSTRIAS DE TRANSFORMACAO | 1343 | 6.7% | |
| ALOJAMENTO E ALIMENTACAO | 1316 | 6.6% | |
| CONSTRUCAO | 1132 | 5.7% | |
| ATIVIDADES ADMINISTRATIVAS E SERVICOS COMPLEMENTARES | 937 | 4.7% | |
| ATIVIDADES PROFISSIONAIS CIENTIFICAS E TECNICAS | 784 | 3.9% | |
| TRANSPORTE ARMAZENAGEM E CORREIO | 666 | 3.3% | |
| EDUCACAO | 463 | 2.3% | |
| SAUDE HUMANA E SERVICOS SOCIAIS | 451 | 2.3% | |
| Other values (11) | 1125 | 5.6% |
| Max length | 65 |
|---|---|
| Mean length | 42.612 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nu_meses_rescencia
Numeric
| Distinct count | 39 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 9.5% |
| Missing (n) | 1905 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 24.976 |
|---|---|
| Minimum | 7 |
| Maximum | 54 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 22 |
| Median | 23 |
| Q3 | 25 |
| 95-th percentile | 48 |
| Maximum | 54 |
| Range | 47 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 9.651 |
|---|---|
| Coef of variation | 0.38641 |
| Kurtosis | 1.9685 |
| Mean | 24.976 |
| MAD | 5.8477 |
| Skewness | 1.2694 |
| Sum | 4.5195e+05 |
| Variance | 93.142 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 23 | 4744 | 23.7% | |
| 22 | 3838 | 19.2% | |
| 24 | 1915 | 9.6% | |
| 48 | 1140 | 5.7% | |
| 25 | 1101 | 5.5% | |
| 26 | 1079 | 5.4% | |
| 27 | 811 | 4.1% | |
| 21 | 506 | 2.5% | |
| 7 | 415 | 2.1% | |
| 9 | 405 | 2.0% | |
| Other values (28) | 2141 | 10.7% | |
| (Missing) | 1905 | 9.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 7 | 415 | 2.1% | |
| 8 | 249 | 1.2% | |
| 9 | 405 | 2.0% | |
| 10 | 279 | 1.4% | |
| 11 | 199 | 1.0% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 54 | 75 | 0.4% | |
| 53 | 3 | < 0.1% | |
| 52 | 39 | 0.2% | |
| 51 | 2 | < 0.1% | |
| 50 | 296 | 1.5% |
percent_func_genero_fem
Numeric
| Distinct count | 314 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 83.6% |
| Missing (n) | 16721 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.353 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 4.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 40 |
| Q3 | 84.675 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 84.675 |
Descriptive statistics
| Standard deviation | 39.175 |
|---|---|
| Coef of variation | 0.88325 |
| Kurtosis | -1.4666 |
| Mean | 44.353 |
| MAD | 34.873 |
| Skewness | 0.25028 |
| Sum | 1.4544e+05 |
| Variance | 1534.7 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 961 | 4.8% | |
| 100 | 755 | 3.8% | |
| 50 | 281 | 1.4% | |
| 33.33 | 116 | 0.6% | |
| 25 | 102 | 0.5% | |
| 66.67 | 88 | 0.4% | |
| 75 | 54 | 0.3% | |
| 40 | 44 | 0.2% | |
| 20 | 42 | 0.2% | |
| 60 | 37 | 0.2% | |
| Other values (303) | 799 | 4.0% | |
| (Missing) | 16721 | 83.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 961 | 4.8% | |
| 0.89 | 1 | < 0.1% | |
| 0.95 | 1 | < 0.1% | |
| 1.19 | 1 | < 0.1% | |
| 1.54 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 755 | 3.8% | |
| 94.44 | 2 | < 0.1% | |
| 94.12 | 2 | < 0.1% | |
| 93.75 | 2 | < 0.1% | |
| 92.86 | 3 | < 0.1% |
percent_func_genero_masc
Numeric
| Distinct count | 314 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 83.6% |
| Missing (n) | 16721 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 55.647 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 3.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 15.325 |
| Median | 60 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 84.675 |
Descriptive statistics
| Standard deviation | 39.175 |
|---|---|
| Coef of variation | 0.704 |
| Kurtosis | -1.4666 |
| Mean | 55.647 |
| MAD | 34.873 |
| Skewness | -0.25028 |
| Sum | 1.8246e+05 |
| Variance | 1534.7 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 100 | 961 | 4.8% | |
| 0 | 755 | 3.8% | |
| 50 | 281 | 1.4% | |
| 66.67 | 116 | 0.6% | |
| 75 | 102 | 0.5% | |
| 33.33 | 88 | 0.4% | |
| 25 | 54 | 0.3% | |
| 60 | 44 | 0.2% | |
| 80 | 42 | 0.2% | |
| 40 | 37 | 0.2% | |
| Other values (303) | 799 | 4.0% | |
| (Missing) | 16721 | 83.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 755 | 3.8% | |
| 5.56 | 2 | < 0.1% | |
| 5.88 | 2 | < 0.1% | |
| 6.25 | 2 | < 0.1% | |
| 7.14 | 3 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 961 | 4.8% | |
| 99.11 | 1 | < 0.1% | |
| 99.05 | 1 | < 0.1% | |
| 98.81 | 1 | < 0.1% | |
| 98.46 | 1 | < 0.1% |
qt_admitidos
Numeric
| Distinct count | 299 |
|---|---|
| Unique (%) | 1.5% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 43.8 |
|---|---|
| Minimum | 1 |
| Maximum | 12447 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 6 |
| Q3 | 19 |
| 95-th percentile | 128.8 |
| Maximum | 12447 |
| Range | 12446 |
| Interquartile range | 17 |
Descriptive statistics
| Standard deviation | 322.41 |
|---|---|
| Coef of variation | 7.3609 |
| Kurtosis | 773.39 |
| Mean | 43.8 |
| MAD | 61.394 |
| Skewness | 25.07 |
| Sum | 1.9644e+05 |
| Variance | 1.0395e+05 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 848 | 4.2% | |
| 2 | 470 | 2.4% | |
| 3 | 354 | 1.8% | |
| 4 | 259 | 1.3% | |
| 5 | 210 | 1.1% | |
| 6 | 168 | 0.8% | |
| 7 | 157 | 0.8% | |
| 8 | 135 | 0.7% | |
| 9 | 126 | 0.6% | |
| 10 | 92 | 0.5% | |
| Other values (288) | 1666 | 8.3% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 848 | 4.2% | |
| 2 | 470 | 2.4% | |
| 3 | 354 | 1.8% | |
| 4 | 259 | 1.3% | |
| 5 | 210 | 1.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 12447 | 1 | < 0.1% | |
| 9208 | 1 | < 0.1% | |
| 7849 | 1 | < 0.1% | |
| 6169 | 1 | < 0.1% | |
| 6103 | 1 | < 0.1% |
qt_admitidos_12meses
Numeric
| Distinct count | 60 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.8707 |
|---|---|
| Minimum | 0 |
| Maximum | 557 |
| Zeros (%) | 16.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 7 |
| Maximum | 557 |
| Range | 557 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 12.544 |
|---|---|
| Coef of variation | 6.7056 |
| Kurtosis | 1014.9 |
| Mean | 1.8707 |
| MAD | 2.8594 |
| Skewness | 27.343 |
| Sum | 8390 |
| Variance | 157.35 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 3209 | 16.0% | |
| 1 | 470 | 2.4% | |
| 2 | 216 | 1.1% | |
| 3 | 127 | 0.6% | |
| 4 | 99 | 0.5% | |
| 5 | 59 | 0.3% | |
| 6 | 51 | 0.3% | |
| 7 | 32 | 0.2% | |
| 9 | 25 | 0.1% | |
| 8 | 23 | 0.1% | |
| Other values (49) | 174 | 0.9% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 3209 | 16.0% | |
| 1 | 470 | 2.4% | |
| 2 | 216 | 1.1% | |
| 3 | 127 | 0.6% | |
| 4 | 99 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 557 | 1 | < 0.1% | |
| 331 | 1 | < 0.1% | |
| 248 | 1 | < 0.1% | |
| 163 | 1 | < 0.1% | |
| 151 | 1 | < 0.1% |
qt_alteracao_socio_180d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_365d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_90d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_total
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_art
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 1 |
|---|
qt_coligadas
Numeric
| Distinct count | 40 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 89.4% |
| Missing (n) | 17889 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.8745 |
|---|---|
| Minimum | 1 |
| Maximum | 289 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 289 |
| Range | 288 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 9.2704 |
|---|---|
| Coef of variation | 3.2251 |
| Kurtosis | 524.71 |
| Mean | 2.8745 |
| MAD | 2.4248 |
| Skewness | 20.236 |
| Sum | 6068 |
| Variance | 85.94 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 1176 | 5.9% | |
| 2 | 406 | 2.0% | |
| 3 | 193 | 1.0% | |
| 4 | 107 | 0.5% | |
| 5 | 60 | 0.3% | |
| 6 | 42 | 0.2% | |
| 7 | 28 | 0.1% | |
| 9 | 13 | 0.1% | |
| 8 | 12 | 0.1% | |
| 10 | 10 | 0.1% | |
| Other values (29) | 64 | 0.3% | |
| (Missing) | 17889 | 89.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 1176 | 5.9% | |
| 2 | 406 | 2.0% | |
| 3 | 193 | 1.0% | |
| 4 | 107 | 0.5% | |
| 5 | 60 | 0.3% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 289 | 1 | < 0.1% | |
| 163 | 1 | < 0.1% | |
| 161 | 1 | < 0.1% | |
| 118 | 1 | < 0.1% | |
| 61 | 1 | < 0.1% |
qt_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.99975 |
|---|
qt_coligados_agropecuaria
Numeric
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.12731 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros (%) | 12.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.64445 |
|---|---|
| Coef of variation | 5.0622 |
| Kurtosis | 71.998 |
| Mean | 0.12731 |
| MAD | 0.23798 |
| Skewness | 7.7178 |
| Sum | 345 |
| Variance | 0.41531 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 2533 | 12.7% | |
| 1 | 107 | 0.5% | |
| 2 | 35 | 0.2% | |
| 3 | 13 | 0.1% | |
| 7 | 10 | 0.1% | |
| 4 | 7 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2533 | 12.7% | |
| 1 | 107 | 0.5% | |
| 2 | 35 | 0.2% | |
| 3 | 13 | 0.1% | |
| 4 | 7 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9 | 1 | < 0.1% | |
| 7 | 10 | 0.1% | |
| 6 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 4 | 7 | < 0.1% |
qt_coligados_atividade_alto
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_baixo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_inativo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_medio
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_mt_baixo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_ativo
Highly correlated
This variable is highly correlated with qt_coligados and should be ignored for analysis
| Correlation | 0.99782 |
|---|
qt_coligados_baixada
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2700 |
|---|---|
| 1 | 8 |
| 2 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2700 | 13.5% | |
| 1 | 8 | < 0.1% | |
| 2 | 2 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_ccivil
Numeric
| Distinct count | 26 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.8679 |
|---|---|
| Minimum | 0 |
| Maximum | 409 |
| Zeros (%) | 11.3% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 409 |
| Range | 409 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 11.481 |
|---|---|
| Coef of variation | 13.229 |
| Kurtosis | 794.21 |
| Mean | 0.8679 |
| MAD | 1.4418 |
| Skewness | 26.521 |
| Sum | 2352 |
| Variance | 131.82 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 2251 | 11.3% | |
| 1 | 305 | 1.5% | |
| 2 | 62 | 0.3% | |
| 3 | 33 | 0.2% | |
| 7 | 10 | 0.1% | |
| 4 | 9 | < 0.1% | |
| 6 | 8 | < 0.1% | |
| 5 | 5 | < 0.1% | |
| 10 | 3 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| Other values (15) | 22 | 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2251 | 11.3% | |
| 1 | 305 | 1.5% | |
| 2 | 62 | 0.3% | |
| 3 | 33 | 0.2% | |
| 4 | 9 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 409 | 1 | < 0.1% | |
| 274 | 1 | < 0.1% | |
| 241 | 1 | < 0.1% | |
| 130 | 1 | < 0.1% | |
| 128 | 2 | < 0.1% |
qt_coligados_centro
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.17343 |
|---|---|
| Minimum | 0 |
| Maximum | 36 |
| Zeros (%) | 12.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 1.2963 |
|---|---|
| Coef of variation | 7.4747 |
| Kurtosis | 457.85 |
| Mean | 0.17343 |
| MAD | 0.32626 |
| Skewness | 18.552 |
| Sum | 470 |
| Variance | 1.6805 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 2549 | 12.7% | |
| 1 | 79 | 0.4% | |
| 2 | 37 | 0.2% | |
| 3 | 13 | 0.1% | |
| 6 | 10 | 0.1% | |
| 4 | 5 | < 0.1% | |
| 8 | 5 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 36 | 2 | < 0.1% | |
| Other values (3) | 3 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2549 | 12.7% | |
| 1 | 79 | 0.4% | |
| 2 | 37 | 0.2% | |
| 3 | 13 | 0.1% | |
| 4 | 5 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 36 | 2 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 5 | < 0.1% |
qt_coligados_comercio
Numeric
| Distinct count | 21 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.94982 |
|---|---|
| Minimum | 0 |
| Maximum | 29 |
| Zeros (%) | 7.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 1.9071 |
|---|---|
| Coef of variation | 2.0079 |
| Kurtosis | 58.422 |
| Mean | 0.94982 |
| MAD | 0.99678 |
| Skewness | 6.0607 |
| Sum | 2574 |
| Variance | 3.6372 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1422 | 7.1% | |
| 1 | 808 | 4.0% | |
| 2 | 238 | 1.2% | |
| 3 | 99 | 0.5% | |
| 4 | 42 | 0.2% | |
| 6 | 32 | 0.2% | |
| 5 | 21 | 0.1% | |
| 8 | 15 | 0.1% | |
| 7 | 13 | 0.1% | |
| 20 | 4 | < 0.1% | |
| Other values (10) | 16 | 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1422 | 7.1% | |
| 1 | 808 | 4.0% | |
| 2 | 238 | 1.2% | |
| 3 | 99 | 0.5% | |
| 4 | 42 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 29 | 1 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 20 | 4 | < 0.1% | |
| 18 | 1 | < 0.1% | |
| 17 | 3 | < 0.1% |
qt_coligados_epp
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2708 |
|---|---|
| 1 | 1 |
| 6 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2708 | 13.5% | |
| 1 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_exterior
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2677 |
|---|---|
| 1 | 18 |
| 2 | 13 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2677 | 13.4% | |
| 1 | 18 | 0.1% | |
| 2 | 13 | 0.1% | |
| 3 | 2 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_inapta
Highly correlated
This variable is highly correlated with coligada_mais_antiga_baixada and should be ignored for analysis
| Correlation | 0.937 |
|---|
qt_coligados_industria
Numeric
| Distinct count | 20 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.41328 |
|---|---|
| Minimum | 0 |
| Maximum | 111 |
| Zeros (%) | 11.5% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 3.5146 |
|---|---|
| Coef of variation | 8.5041 |
| Kurtosis | 746.57 |
| Mean | 0.41328 |
| MAD | 0.69938 |
| Skewness | 25.232 |
| Sum | 1120 |
| Variance | 12.353 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 2293 | 11.5% | |
| 1 | 300 | 1.5% | |
| 2 | 52 | 0.3% | |
| 3 | 20 | 0.1% | |
| 4 | 16 | 0.1% | |
| 5 | 5 | < 0.1% | |
| 10 | 5 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 12 | 2 | < 0.1% | |
| 14 | 2 | < 0.1% | |
| Other values (9) | 10 | 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2293 | 11.5% | |
| 1 | 300 | 1.5% | |
| 2 | 52 | 0.3% | |
| 3 | 20 | 0.1% | |
| 4 | 16 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 111 | 2 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 43 | 1 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 26 | 1 | < 0.1% |
qt_coligados_ltda
Numeric
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.086347 |
|---|---|
| Minimum | 0 |
| Maximum | 29 |
| Zeros (%) | 12.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.77469 |
|---|---|
| Coef of variation | 8.9718 |
| Kurtosis | 802.07 |
| Mean | 0.086347 |
| MAD | 0.16486 |
| Skewness | 24.669 |
| Sum | 234 |
| Variance | 0.60015 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 2587 | 12.9% | |
| 1 | 92 | 0.5% | |
| 2 | 16 | 0.1% | |
| 3 | 5 | < 0.1% | |
| 4 | 3 | < 0.1% | |
| 9 | 2 | < 0.1% | |
| 29 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2587 | 12.9% | |
| 1 | 92 | 0.5% | |
| 2 | 16 | 0.1% | |
| 3 | 5 | < 0.1% | |
| 4 | 3 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 29 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 9 | 2 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% |
qt_coligados_matriz
Highly correlated
This variable is highly correlated with qt_coligados_ativo and should be ignored for analysis
| Correlation | 0.99805 |
|---|
qt_coligados_me
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2699 |
|---|---|
| 1 | 11 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2699 | 13.5% | |
| 1 | 11 | 0.1% | |
| (Missing) | 17290 | 86.5% |
qt_coligados_mei
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2642 |
|---|---|
| 1 | 66 |
| 2 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2642 | 13.2% | |
| 1 | 66 | 0.3% | |
| 2 | 2 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_nordeste
Numeric
| Distinct count | 36 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.9406 |
|---|---|
| Minimum | 0 |
| Maximum | 163 |
| Zeros (%) | 5.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 163 |
| Range | 163 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 4.901 |
|---|---|
| Coef of variation | 2.5255 |
| Kurtosis | 443.11 |
| Mean | 1.9406 |
| MAD | 2.0463 |
| Skewness | 15.574 |
| Sum | 5259 |
| Variance | 24.02 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1043 | 5.2% | |
| 1 | 796 | 4.0% | |
| 2 | 344 | 1.7% | |
| 3 | 178 | 0.9% | |
| 4 | 83 | 0.4% | |
| 6 | 51 | 0.3% | |
| 5 | 49 | 0.2% | |
| 7 | 37 | 0.2% | |
| 8 | 27 | 0.1% | |
| 9 | 18 | 0.1% | |
| Other values (25) | 84 | 0.4% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1043 | 5.2% | |
| 1 | 796 | 4.0% | |
| 2 | 344 | 1.7% | |
| 3 | 178 | 0.9% | |
| 4 | 83 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 163 | 1 | < 0.1% | |
| 44 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 39 | 2 | < 0.1% | |
| 36 | 2 | < 0.1% |
qt_coligados_norte
Numeric
| Distinct count | 26 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.81365 |
|---|---|
| Minimum | 0 |
| Maximum | 49 |
| Zeros (%) | 9.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 49 |
| Range | 49 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.5349 |
|---|---|
| Coef of variation | 3.1154 |
| Kurtosis | 145.28 |
| Mean | 0.81365 |
| MAD | 1.0887 |
| Skewness | 9.9856 |
| Sum | 2205 |
| Variance | 6.4256 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1813 | 9.1% | |
| 1 | 508 | 2.5% | |
| 2 | 184 | 0.9% | |
| 3 | 70 | 0.4% | |
| 4 | 45 | 0.2% | |
| 5 | 26 | 0.1% | |
| 6 | 18 | 0.1% | |
| 9 | 8 | < 0.1% | |
| 7 | 5 | < 0.1% | |
| 13 | 5 | < 0.1% | |
| Other values (15) | 28 | 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1813 | 9.1% | |
| 1 | 508 | 2.5% | |
| 2 | 184 | 0.9% | |
| 3 | 70 | 0.4% | |
| 4 | 45 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 49 | 1 | < 0.1% | |
| 46 | 2 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 25 | 2 | < 0.1% | |
| 22 | 2 | < 0.1% |
qt_coligados_nula
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_sa
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.91043 |
|---|
qt_coligados_serviço
Numeric
| Distinct count | 46 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.624 |
|---|---|
| Minimum | 0 |
| Maximum | 126 |
| Zeros (%) | 5.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 8 |
| Maximum | 126 |
| Range | 126 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 9.8838 |
|---|---|
| Coef of variation | 3.7667 |
| Kurtosis | 108.08 |
| Mean | 2.624 |
| MAD | 3.1567 |
| Skewness | 9.7581 |
| Sum | 7111 |
| Variance | 97.69 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1000 | 5.0% | |
| 1 | 899 | 4.5% | |
| 2 | 310 | 1.6% | |
| 3 | 156 | 0.8% | |
| 4 | 75 | 0.4% | |
| 5 | 56 | 0.3% | |
| 6 | 41 | 0.2% | |
| 7 | 26 | 0.1% | |
| 9 | 25 | 0.1% | |
| 8 | 23 | 0.1% | |
| Other values (35) | 99 | 0.5% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1000 | 5.0% | |
| 1 | 899 | 4.5% | |
| 2 | 310 | 1.6% | |
| 3 | 156 | 0.8% | |
| 4 | 75 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 126 | 10 | 0.1% | |
| 110 | 1 | < 0.1% | |
| 104 | 1 | < 0.1% | |
| 90 | 2 | < 0.1% | |
| 88 | 2 | < 0.1% |
qt_coligados_sudeste
Highly correlated
This variable is highly correlated with qt_coligados_matriz and should be ignored for analysis
| Correlation | 0.95437 |
|---|
qt_coligados_sul
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.91104 |
|---|
qt_coligados_suspensa
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| 0 | 2709 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2709 | 13.5% | |
| 1 | 1 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
qt_desligados
Highly correlated
This variable is highly correlated with qt_admitidos and should be ignored for analysis
| Correlation | 0.93477 |
|---|
qt_desligados_12meses
Numeric
| Distinct count | 60 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.936 |
|---|---|
| Minimum | 0 |
| Maximum | 914 |
| Zeros (%) | 15.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 914 |
| Range | 914 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 18.404 |
|---|---|
| Coef of variation | 9.506 |
| Kurtosis | 1565.1 |
| Mean | 1.936 |
| MAD | 2.9695 |
| Skewness | 35.828 |
| Sum | 8683 |
| Variance | 338.7 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 3181 | 15.9% | |
| 1 | 535 | 2.7% | |
| 2 | 235 | 1.2% | |
| 3 | 129 | 0.6% | |
| 4 | 86 | 0.4% | |
| 5 | 62 | 0.3% | |
| 6 | 41 | 0.2% | |
| 7 | 38 | 0.2% | |
| 8 | 26 | 0.1% | |
| 9 | 17 | 0.1% | |
| Other values (49) | 135 | 0.7% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 3181 | 15.9% | |
| 1 | 535 | 2.7% | |
| 2 | 235 | 1.2% | |
| 3 | 129 | 0.6% | |
| 4 | 86 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 914 | 1 | < 0.1% | |
| 557 | 1 | < 0.1% | |
| 328 | 1 | < 0.1% | |
| 249 | 1 | < 0.1% | |
| 233 | 1 | < 0.1% |
qt_ex_funcionarios
Highly correlated
This variable is highly correlated with qt_desligados and should be ignored for analysis
| Correlation | 1 |
|---|
qt_filiais
Numeric
| Distinct count | 135 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 24.286 |
|---|---|
| Minimum | 0 |
| Maximum | 9647 |
| Zeros (%) | 90.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9647 |
| Range | 9647 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 435.24 |
|---|---|
| Coef of variation | 17.921 |
| Kurtosis | 421.85 |
| Mean | 24.286 |
| MAD | 47.437 |
| Skewness | 20.326 |
| Sum | 4.8572e+05 |
| Variance | 1.8944e+05 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 18181 | 90.9% | |
| 1 | 826 | 4.1% | |
| 2 | 254 | 1.3% | |
| 3 | 125 | 0.6% | |
| 4 | 78 | 0.4% | |
| 5 | 44 | 0.2% | |
| 6 | 32 | 0.2% | |
| 7 | 27 | 0.1% | |
| 9 | 24 | 0.1% | |
| 11 | 22 | 0.1% | |
| Other values (125) | 387 | 1.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 18181 | 90.9% | |
| 1 | 826 | 4.1% | |
| 2 | 254 | 1.3% | |
| 3 | 125 | 0.6% | |
| 4 | 78 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9647 | 10 | 0.1% | |
| 9411 | 14 | 0.1% | |
| 9270 | 11 | 0.1% | |
| 7687 | 7 | < 0.1% | |
| 5491 | 3 | < 0.1% |
qt_funcionarios
Highly correlated
This variable is highly correlated with idade_de_54_a_58 and should be ignored for analysis
| Correlation | 0.9349 |
|---|
qt_funcionarios_12meses
Highly correlated
This variable is highly correlated with qt_funcionarios and should be ignored for analysis
| Correlation | 0.99486 |
|---|
qt_funcionarios_24meses
Highly correlated
This variable is highly correlated with qt_funcionarios_12meses and should be ignored for analysis
| Correlation | 0.97492 |
|---|
qt_funcionarios_coligados
Highly correlated
This variable is highly correlated with coligada_mais_nova_baixada and should be ignored for analysis
| Correlation | 0.94351 |
|---|
qt_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.99611 |
|---|
qt_funcionarios_grupo
Numeric
| Distinct count | 355 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 75.3% |
| Missing (n) | 15052 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8546.4 |
|---|---|
| Minimum | 0 |
| Maximum | 2.6687e+06 |
| Zeros (%) | 6.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 8 |
| 95-th percentile | 412 |
| Maximum | 2.6687e+06 |
| Range | 2.6687e+06 |
| Interquartile range | 7 |
Descriptive statistics
| Standard deviation | 1.2367e+05 |
|---|---|
| Coef of variation | 14.471 |
| Kurtosis | 345.6 |
| Mean | 8546.4 |
| MAD | 16644 |
| Skewness | 17.932 |
| Sum | 4.2287e+07 |
| Variance | 1.5295e+10 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1208 | 6.0% | |
| 1 | 919 | 4.6% | |
| 2 | 488 | 2.4% | |
| 3 | 327 | 1.6% | |
| 4 | 245 | 1.2% | |
| 5 | 201 | 1.0% | |
| 6 | 137 | 0.7% | |
| 7 | 133 | 0.7% | |
| 8 | 90 | 0.4% | |
| 10 | 68 | 0.3% | |
| Other values (344) | 1132 | 5.7% | |
| (Missing) | 15052 | 75.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1208 | 6.0% | |
| 1 | 919 | 4.6% | |
| 2 | 488 | 2.4% | |
| 3 | 327 | 1.6% | |
| 4 | 245 | 1.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2.6687e+06 | 7 | < 0.1% | |
| 1.4844e+06 | 11 | 0.1% | |
| 3.8039e+05 | 2 | < 0.1% | |
| 3.7704e+05 | 1 | < 0.1% | |
| 3.511e+05 | 10 | 0.1% |
qt_ramos_coligados
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.91284 |
|---|
qt_regioes_coligados
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.1967 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.59473 |
|---|---|
| Coef of variation | 0.49698 |
| Kurtosis | 15.101 |
| Mean | 1.1967 |
| MAD | 0.34226 |
| Skewness | 3.7108 |
| Sum | 3243 |
| Variance | 0.3537 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 2358 | 11.8% | |
| 2 | 244 | 1.2% | |
| 4 | 54 | 0.3% | |
| 3 | 45 | 0.2% | |
| 5 | 8 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 2358 | 11.8% | |
| 2 | 244 | 1.2% | |
| 3 | 45 | 0.2% | |
| 4 | 54 | 0.3% | |
| 5 | 8 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6 | 1 | < 0.1% | |
| 5 | 8 | < 0.1% | |
| 4 | 54 | 0.3% | |
| 3 | 45 | 0.2% | |
| 2 | 244 | 1.2% |
qt_socios
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.93688 |
|---|
qt_socios_coligados
Numeric
| Distinct count | 86 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.5% |
| Missing (n) | 17290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.0266 |
|---|---|
| Minimum | 1 |
| Maximum | 989 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 4 |
| 95-th percentile | 22 |
| Maximum | 989 |
| Range | 988 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 46.887 |
|---|---|
| Coef of variation | 5.1943 |
| Kurtosis | 164.56 |
| Mean | 9.0266 |
| MAD | 11.815 |
| Skewness | 11.901 |
| Sum | 24462 |
| Variance | 2198.4 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 1105 | 5.5% | |
| 2 | 559 | 2.8% | |
| 3 | 249 | 1.2% | |
| 4 | 178 | 0.9% | |
| 5 | 102 | 0.5% | |
| 6 | 80 | 0.4% | |
| 7 | 59 | 0.3% | |
| 8 | 43 | 0.2% | |
| 9 | 32 | 0.2% | |
| 11 | 29 | 0.1% | |
| Other values (75) | 274 | 1.4% | |
| (Missing) | 17290 | 86.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 1105 | 5.5% | |
| 2 | 559 | 2.8% | |
| 3 | 249 | 1.2% | |
| 4 | 178 | 0.9% | |
| 5 | 102 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 989 | 1 | < 0.1% | |
| 683 | 1 | < 0.1% | |
| 675 | 1 | < 0.1% | |
| 616 | 1 | < 0.1% | |
| 506 | 10 | 0.1% |
qt_socios_feminino
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 68.8% |
| Missing (n) | 13768 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.0968 |
|---|---|
| Minimum | 1 |
| Maximum | 17 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.52488 |
|---|---|
| Coef of variation | 0.47858 |
| Kurtosis | 353.79 |
| Mean | 1.0968 |
| MAD | 0.17995 |
| Skewness | 15.271 |
| Sum | 6835 |
| Variance | 0.2755 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 1 | 5795 | 29.0% | |
| 2 | 368 | 1.8% | |
| 3 | 43 | 0.2% | |
| 4 | 11 | 0.1% | |
| 5 | 5 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 9 | 2 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| Other values (2) | 2 | < 0.1% | |
| (Missing) | 13768 | 68.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 5795 | 29.0% | |
| 2 | 368 | 1.8% | |
| 3 | 43 | 0.2% | |
| 4 | 11 | 0.1% | |
| 5 | 5 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 17 | 1 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 2 | < 0.1% |
qt_socios_masculino
Highly correlated
This variable is highly correlated with qt_socios and should be ignored for analysis
| Correlation | 0.97996 |
|---|
qt_socios_pep
Highly correlated
This variable is highly correlated with qt_socios_masculino and should be ignored for analysis
| Correlation | 0.99002 |
|---|
qt_socios_pf
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 0.98016 |
|---|
qt_socios_pj
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 24.8% |
| Missing (n) | 4959 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.018283 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros (%) | 74.3% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.18196 |
|---|---|
| Coef of variation | 9.9523 |
| Kurtosis | 260.16 |
| Mean | 0.018283 |
| MAD | 0.036112 |
| Skewness | 13.395 |
| Sum | 275 |
| Variance | 0.03311 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 14854 | 74.3% | |
| 1 | 114 | 0.6% | |
| 2 | 63 | 0.3% | |
| 3 | 8 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| (Missing) | 4959 | 24.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 14854 | 74.3% | |
| 1 | 114 | 0.6% | |
| 2 | 63 | 0.3% | |
| 3 | 8 | < 0.1% | |
| 4 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 7 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 3 | 8 | < 0.1% | |
| 2 | 63 | 0.3% | |
| 1 | 114 | 0.6% |
qt_socios_pj_ativos
Highly correlated
This variable is highly correlated with qt_socios_pj and should be ignored for analysis
| Correlation | 0.96123 |
|---|
qt_socios_pj_baixados
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 19813 |
| 0 | 185 |
|---|---|
| 1 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 185 | 0.9% | |
| 1 | 2 | < 0.1% | |
| (Missing) | 19813 | 99.1% |
qt_socios_pj_inaptos
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 19813 |
| 0 | 185 |
|---|---|
| 1 | 1 |
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 185 | 0.9% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| (Missing) | 19813 | 99.1% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios_pj_nulos
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 19813 |
| 0 | 186 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 186 | 0.9% | |
| 1 | 1 | < 0.1% | |
| (Missing) | 19813 | 99.1% |
qt_socios_pj_suspensos
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_socios_st_regular
Highly correlated
This variable is highly correlated with qt_socios_pf and should be ignored for analysis
| Correlation | 0.98725 |
|---|
qt_socios_st_suspensa
Highly correlated
This variable is highly correlated with min_filiais_coligados and should be ignored for analysis
| Correlation | 1 |
|---|
qt_ufs_coligados
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.92173 |
|---|
setor
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| COMERCIO | |
|---|---|
| SERVIÇO | |
| INDUSTRIA | 1280 |
| Other values (2) | 1280 |
| Value | Count | Frequency (%) | |
| COMERCIO | 9146 | 45.7% | |
| SERVIÇO | 8220 | 41.1% | |
| INDUSTRIA | 1280 | 6.4% | |
| CONSTRUÇÃO CIVIL | 1132 | 5.7% | |
| AGROPECUARIA | 148 | 0.7% | |
| (Missing) | 74 | 0.4% |
| Max length | 16 |
|---|---|
| Mean length | 8.1169 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
sg_uf
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| MA | 5563 | 27.8% | |
| RN | 4293 | 21.5% | |
| AM | 3535 | 17.7% | |
| PI | 3279 | 16.4% | |
| RO | 2419 | 12.1% | |
| AC | 911 | 4.6% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sg_uf_matriz
Categorical
| Distinct count | 27 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.4% |
| Missing (n) | 74 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (23) |
| Value | Count | Frequency (%) | |
| MA | 5446 | 27.2% | |
| RN | 4222 | 21.1% | |
| AM | 3440 | 17.2% | |
| PI | 3225 | 16.1% | |
| RO | 2357 | 11.8% | |
| AC | 884 | 4.4% | |
| SP | 116 | 0.6% | |
| CE | 44 | 0.2% | |
| DF | 40 | 0.2% | |
| RJ | 25 | 0.1% | |
| Other values (16) | 127 | 0.6% | |
| (Missing) | 74 | 0.4% |
| Max length | 3 |
|---|---|
| Mean length | 2.0037 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sum_faturamento_estimado_coligadas
Highly correlated
This variable is highly correlated with max_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9227 |
|---|
total
Highly correlated
This variable is highly correlated with qt_funcionarios_24meses and should be ignored for analysis
| Correlation | 0.96062 |
|---|
total_filiais_coligados
Highly correlated
This variable is highly correlated with max_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.92829 |
|---|
tx_crescimento_12meses
Numeric
| Distinct count | 287 |
|---|---|
| Unique (%) | 1.4% |
| Missing (%) | 84.1% |
| Missing (n) | 16819 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.3591 |
|---|---|
| Minimum | -100 |
| Maximum | 8566.7 |
| Zeros (%) | 9.7% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -66.667 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 50 |
| Maximum | 8566.7 |
| Range | 8666.7 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 158.8 |
|---|---|
| Coef of variation | 67.314 |
| Kurtosis | 2662.7 |
| Mean | 2.3591 |
| MAD | 22.46 |
| Skewness | 49.472 |
| Sum | 7504.3 |
| Variance | 25218 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1933 | 9.7% | |
| -100 | 124 | 0.6% | |
| -50 | 83 | 0.4% | |
| -33.333 | 70 | 0.4% | |
| 100 | 49 | 0.2% | |
| 50 | 47 | 0.2% | |
| -25 | 44 | 0.2% | |
| 33.333 | 38 | 0.2% | |
| -16.667 | 33 | 0.2% | |
| 20 | 30 | 0.1% | |
| Other values (276) | 730 | 3.6% | |
| (Missing) | 16819 | 84.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 124 | 0.6% | |
| -97.826 | 1 | < 0.1% | |
| -95.69 | 1 | < 0.1% | |
| -93.827 | 1 | < 0.1% | |
| -92.308 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8566.7 | 1 | < 0.1% | |
| 700 | 1 | < 0.1% | |
| 600 | 1 | < 0.1% | |
| 500 | 2 | < 0.1% | |
| 480 | 1 | < 0.1% |
tx_crescimento_24meses
Numeric
| Distinct count | 423 |
|---|---|
| Unique (%) | 2.1% |
| Missing (%) | 83.9% |
| Missing (n) | 16772 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -12.264 |
|---|---|
| Minimum | -100 |
| Maximum | 6700 |
| Zeros (%) | 5.9% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -100 |
| Q1 | -45.175 |
| Median | -0.52502 |
| Q3 | 0 |
| 95-th percentile | 66.083 |
| Maximum | 6700 |
| Range | 6800 |
| Interquartile range | 45.175 |
Descriptive statistics
| Standard deviation | 136.26 |
|---|---|
| Coef of variation | -11.11 |
| Kurtosis | 1828.6 |
| Mean | -12.264 |
| MAD | 39.516 |
| Skewness | 37.678 |
| Sum | -39589 |
| Variance | 18566 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1185 | 5.9% | |
| -100 | 349 | 1.7% | |
| -50 | 163 | 0.8% | |
| -33.333 | 143 | 0.7% | |
| -25 | 71 | 0.4% | |
| -66.667 | 61 | 0.3% | |
| 100 | 59 | 0.3% | |
| -20 | 58 | 0.3% | |
| 50 | 51 | 0.3% | |
| -40 | 37 | 0.2% | |
| Other values (412) | 1051 | 5.3% | |
| (Missing) | 16772 | 83.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 349 | 1.7% | |
| -99.216 | 1 | < 0.1% | |
| -98.164 | 1 | < 0.1% | |
| -98.039 | 1 | < 0.1% | |
| -96.835 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6700 | 1 | < 0.1% | |
| 1000 | 2 | < 0.1% | |
| 700 | 2 | < 0.1% | |
| 600 | 2 | < 0.1% | |
| 500 | 6 | < 0.1% |
tx_rotatividade
Numeric
| Distinct count | 311 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 77.6% |
| Missing (n) | 15515 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.2241 |
|---|---|
| Minimum | 0 |
| Maximum | 320 |
| Zeros (%) | 17.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 52.29 |
| Maximum | 320 |
| Range | 320 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 26.36 |
|---|---|
| Coef of variation | 2.8577 |
| Kurtosis | 31.835 |
| Mean | 9.2241 |
| MAD | 14.452 |
| Skewness | 4.9103 |
| Sum | 41370 |
| Variance | 694.85 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 0 | 3489 | 17.4% | |
| 40 | 52 | 0.3% | |
| 28.571 | 51 | 0.3% | |
| 22.222 | 39 | 0.2% | |
| 18.182 | 38 | 0.2% | |
| 33.333 | 37 | 0.2% | |
| 66.667 | 27 | 0.1% | |
| 100 | 25 | 0.1% | |
| 15.385 | 25 | 0.1% | |
| 25 | 23 | 0.1% | |
| Other values (300) | 679 | 3.4% | |
| (Missing) | 15515 | 77.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 3489 | 17.4% | |
| 1.2195 | 1 | < 0.1% | |
| 1.3986 | 1 | < 0.1% | |
| 1.5748 | 1 | < 0.1% | |
| 1.9417 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 320 | 1 | < 0.1% | |
| 311.11 | 1 | < 0.1% | |
| 266.67 | 1 | < 0.1% | |
| 263.83 | 1 | < 0.1% | |
| 250 | 1 | < 0.1% |
vl_faturamento_estimado_aux
Highly correlated
This variable is highly correlated with total and should be ignored for analysis
| Correlation | 0.97315 |
|---|
vl_faturamento_estimado_grupo_aux
Numeric
| Distinct count | 949 |
|---|---|
| Unique (%) | 4.7% |
| Missing (%) | 5.8% |
| Missing (n) | 1154 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.7649e+08 |
|---|---|
| Minimum | 41213 |
| Maximum | 2.2276e+11 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 41213 |
|---|---|
| 5-th percentile | 50000 |
| Q1 | 1.8546e+05 |
| Median | 2.1e+05 |
| Q3 | 2.1e+05 |
| 95-th percentile | 2.5964e+06 |
| Maximum | 2.2276e+11 |
| Range | 2.2276e+11 |
| Interquartile range | 24542 |
Descriptive statistics
| Standard deviation | 6.6109e+09 |
|---|---|
| Coef of variation | 23.91 |
| Kurtosis | 895.5 |
| Mean | 2.7649e+08 |
| MAD | 5.4477e+08 |
| Skewness | 29.359 |
| Sum | 5.2108e+12 |
| Variance | 4.3704e+19 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 9961 | 49.8% | |
| 50000 | 4171 | 20.9% | |
| 9.3e+05 | 614 | 3.1% | |
| 1.8546e+05 | 321 | 1.6% | |
| 4.2e+05 | 276 | 1.4% | |
| 1.2364e+05 | 252 | 1.3% | |
| 3.7092e+05 | 217 | 1.1% | |
| 2.4728e+05 | 157 | 0.8% | |
| 5.5637e+05 | 116 | 0.6% | |
| 7.4183e+05 | 90 | 0.4% | |
| Other values (938) | 2671 | 13.4% | |
| (Missing) | 1154 | 5.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 41213 | 75 | 0.4% | |
| 50000 | 4171 | 20.9% | |
| 51516 | 12 | 0.1% | |
| 61819 | 85 | 0.4% | |
| 82426 | 25 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2.2276e+11 | 7 | < 0.1% | |
| 1.8943e+11 | 10 | 0.1% | |
| 1.8121e+11 | 2 | < 0.1% | |
| 8.9316e+10 | 3 | < 0.1% | |
| 4.3415e+10 | 11 | 0.1% |
vl_folha_coligados
Highly correlated
This variable is highly correlated with total_filiais_coligados and should be ignored for analysis
| Correlation | 0.91586 |
|---|
vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with vl_folha_coligados and should be ignored for analysis
| Correlation | 0.922 |
|---|
vl_frota
Highly correlated
This variable is highly correlated with qt_socios_pj_inaptos and should be ignored for analysis
| Correlation | 0.9575 |
|---|
vl_idade_maxima_socios_pj
Highly correlated
This variable is highly correlated with idade_media_coligadas_baixadas and should be ignored for analysis
| Correlation | 0.98969 |
|---|
vl_idade_media_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_maxima_socios_pj and should be ignored for analysis
| Correlation | 0.95057 |
|---|
vl_idade_minima_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_media_socios_pj and should be ignored for analysis
| Correlation | 0.91241 |
|---|
vl_potenc_cons_oleo_gas
Numeric
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 99.2% |
| Missing (n) | 19845 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 14.395 |
|---|---|
| Minimum | 1 |
| Maximum | 360 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 4 |
| Q3 | 10 |
| 95-th percentile | 69.8 |
| Maximum | 360 |
| Range | 359 |
| Interquartile range | 8 |
Descriptive statistics
| Standard deviation | 36.539 |
|---|---|
| Coef of variation | 2.5382 |
| Kurtosis | 54.772 |
| Mean | 14.395 |
| MAD | 16.731 |
| Skewness | 6.5746 |
| Sum | 2231.3 |
| Variance | 1335.1 |
| Memory size | 952.5 KiB |
| Value | Count | Frequency (%) | |
| 2 | 49 | 0.2% | |
| 1 | 14 | 0.1% | |
| 4 | 14 | 0.1% | |
| 6 | 12 | 0.1% | |
| 8 | 10 | 0.1% | |
| 3 | 9 | < 0.1% | |
| 10 | 7 | < 0.1% | |
| 14 | 5 | < 0.1% | |
| 45 | 5 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| Other values (20) | 26 | 0.1% | |
| (Missing) | 19845 | 99.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 14 | 0.1% | |
| 2 | 49 | 0.2% | |
| 3 | 9 | < 0.1% | |
| 4 | 14 | 0.1% | |
| 5 | 2 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 360 | 1 | < 0.1% | |
| 180 | 1 | < 0.1% | |
| 118 | 1 | < 0.1% | |
| 102 | 1 | < 0.1% | |
| 78.3 | 1 | < 0.1% |
vl_total_tancagem
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.99476 |
|---|
vl_total_tancagem_grupo
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.90065 |
|---|
vl_total_veiculos_antt
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.99317 |
|---|
vl_total_veiculos_antt_grupo
Highly correlated
This variable is highly correlated with vl_total_veiculos_antt and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_leves
Highly correlated
This variable is highly correlated with vl_total_veiculos_antt_grupo and should be ignored for analysis
| Correlation | 0.9581 |
|---|
vl_total_veiculos_leves_grupo
Highly correlated
This variable is highly correlated with vl_total_veiculos_antt and should be ignored for analysis
| Correlation | 0.95917 |
|---|
vl_total_veiculos_pesados
Highly correlated
This variable is highly correlated with vl_total_veiculos_antt_grupo and should be ignored for analysis
| Correlation | 0.97758 |
|---|
vl_total_veiculos_pesados_grupo
Highly correlated
This variable is highly correlated with vl_total_veiculos_antt and should be ignored for analysis
| Correlation | 0.97729 |
|---|